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FIG, 16 



Key: 

HO sequence RAD 5 4 Promoter yEGFP KanMX sequence AmpR REP 2u sequence 

URA3 

GATCCAAGCTATCTACTGAGATTTCTGGCTCTTTTGTTGTACTGTCACCTAACCACAGACCAAGCATCCAA 

GCCATACTTTTTACAGCAGGAGTTACAAGGTCACTACGTCCAGTGAGAAATTTAGATAAAACACCATTTCC 

TGCGAGTACTGGACCAAATCTTATGCAGCTAGAAATTCTCAATTGAGCATCAAGATAATCCAAATCTCTAA 

CTTCAATGTCAAAGTTGA21ATATTCTCCTTTAGAGCGCTCCATTTCTTCTATGAAGCGTTTTGCGGCAAAC 

TCACCTTCAACTGTCATTGGGAATGTCTTATGATGGTTTTTTGGAATTATTATTATCCTACCATCAAGCGT 

CTGACATTGCTGCAGATTTCTCCATCTCACTTTATATTTGGTGGCATTTCTACCACTTTTTTCCAACAGTG 

GTTTGGTAGGGACCCTGACTGACAATTTATGACCTGCAGTACATTGTAATGCAAGACGCTGATAAACTGTT 

CTACGCCTGGGATCTAACCTACCAGGTTCACCTTCAAAAGCTCTGTGTTTGGTTTTTTGCTGTATATTATA 

GATTTTCTGATAGCCCTGTGTGACATTTATGACGCGGGCAGCGGAGCCATCTGCGCACATAACGTAAGAGT 

TAGCCGTGACGTTTGCGATGTCTTTAATTTCACCGTTAGCCATCAGAATAGTCGTGTTTTCAGAAAGCATT 

TTGATCCGAC AT AC GAT GAC C T C AAT GAT T T AGATT AT G T GTT GC AC T TT T AT AGAC C TACCAAAAAT C C A 

G T G C G T AC AC T AAT AC T T T C AT AAAG AT A C C T GAAAC AAT AA C C AG AAAG AT C GG C AAAAAAAT T T T T T T T 

CTT T GCC G AGAT C AC AAAC C TAG TAT GAC GAAAAAGCT T GAAG TT T AGAT GAG T AAG GAAAATAC AAG T GA 

CGCTTTTATATGGTGCAAGGAACAAAJVACTAAAAACAACAAGGCAAATGTGGATCTGTCATGTATGGCAAC 

GACAGCAGGATGGCTCACAAAAAAAGACAAAAAAAACTAAGGCAAAAGAACAAAGCTCCTCTCCTGCTCAA 

GAAAC GTAT T GT T G AAAAAC C AC C G T C GT AAGAAAGTT TT T C T GT GACC T AT AAT G GT T T AAAAT C GG C C C 

ATTTTTTTTCCCTCTTTTGTGGTCCAGTCTTTCTCATACTCGAGGGAAATTCGAGACAAACAGCGGAGAAG 

XGTGGCTAAACCGGCAAGTGCCTGCAAGATCCACAGAACTAACCGCACGAACTGGCGGTCAGAAAAGAGCC 

TGTTCCGGAAAGAGAGAAAC^GAGAAACGATCATGATGGGAAAGCGGGGATTCGGCGAAGAACGAGACTG(3 

AA AGGGAAAAAGAGAAAT ACTGGTGGAAGT ATTCGGAC C TT TGGCGAAGTC CGAACCCT TGAAAC C CAAAG 

ATGATCGATGATTCATTTTTCAATGCGCTACGGTTCCTGCCGCTCGTGGGAACCCCACGCAAAACATATTA 

TTCGCTTCTCTCTGCTGACAACTCCGGTTTACGTTATACCGTATTAGGATCACTATAAGGGTTCCTTCGGG 

AGGAGGGGGGAGGGGAAGAATGTACATCGTCATAAGGCCTTTATGGTGTGAAGTGGGTTTTGCGTGGAAAA 

TTCGTTTTCAATGATATAGAGCCCACGCATATACGTACATACTAGTGGCCAAAAGCGTGGGGTGGGCGGAC 

AAAGC T AC AC T G GT AAAA TAG AGG AT T CT AT G AAC AAT AAC AAC AAC C AGCT C A C GT T G C T GAAC A GCC G A 

GGTCAGCCGATGCAACCGAGGTTTCCAAAGTAGCATTTCTGTGCTAGCTATGTCTGTAGGTTTACATTTAA 

TGGT GCGTGGT T C C AGC T T CAT GT GCT TGCAT GT GAT G TC CT GC AGAT GGT AAGAAGAT T C T GAAAGC C GC 

GCTAGGAGAAAAATATTCTGCTCGAAGATCTGTCCTCTTAAGTAGAAAGCGTGAAATTGTTGCGTTCTTGC 

ATT AC T AC T C A AC G C GT A C GC AAAT GC GT C T AC T G C AC CT GC AT GAT AAAG CT TAT GTAT C AAAAAT T T AA 

CAT CTT GAAAATAC AC AAG T G GT G CAAAG AT G T GT C AC GT T C T G GAC C T GAG T G G T G C CAT GTAT G CT AT T 

TAACATGCAAAGGGGAAGACCCTTCCGCCTTACTGCAATAATAAAAAGTATTTTACGCGTTACCCAATATA 

GC AAAGTT T CGC GC AAAAAAAAAAAT AAAAAAC AA T T A C AAA C AAAAA G A AAAAAAAG G AAAT AA TAG AAG 

A*? CTAACT GAAGCGAAGGCCAAAACTCTT CTC ACT TGAC GT AAT AGC C GATACAAAATC TAGAGCAGC AAC 

TTTTCTC^^TCTT C AC T AAAG C T GC T AC GAAAG T AT AGAAAAAT C AAACGC T C AG AACT T AGC T C TAT T T C 

AAGGT ACCATAT ATATTTCCTTATAACTGA TGTTAATTAACTCTAAAGGTGAAGAA TTATTCACTGGTGTT 

GTCCCAATTTTGGTTGAATTAGATGGTGATGTTAATGGTCACAAATTTTCTGTCTCCGGTGAAGGTGAAGG 

TGATGCTACTTACGGTAAATTGACCTTAAAATTTATTTGTACTACTGGTAAATTGCCAGTTCCATGGCCAA 

CCTTAGTCACTACTTTCGGTTATGGTGTTCAATGTTTTGCGAGATACCCAGATCATATGAAACAACATGAC 

TTTTTCAAGTCTGCCATGCCAGAAGGTTATGTTCAAGAAAGAACTATTTTTTTCAAAGATGACGGTAACTA 

CAAGACCAGAGCTGAAGTCAAGTTTGAAGGTGATACCTTAGTTAATAGAATCGAATTAAAAGGTATTGATT 

TTAAAGAAGATGGTAACATTTTAGGTCACAAATTGGAATACAACTATAACTCTCACAATGTTTACATCATG 

GCTGACAAACAAAAGAATGGTATCAAAGTTAACTTCAAAATTAGACACAACATTGAAGATGGTTCTGTTCA 

ATTAGCTGACCATTATCAACAAAATACTCCAATTGGTGATGGTCCAGTCTTGTTACCAGACAACCATTACT 

TATCCACTCAATCTGCCTTATCCAAAGATCCAAACGAAAAGAGAGACCACATGGTCTTGTTAGAATTTGTT 
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Figure 16 continued 

A C TGCTGC TGGTATTA CCCA TGGTA TGGA TGAATTG T AC AAA FAACT GC AGG GC GC GC C AC T T C T AAAT AA 
G CG AAT T T C T TAT GAT T TAT GA.T T TTXAT TAT TAAATAAGTTAT AAAAAAAAT AAGTG T AT ACAAAT TT T A 
AAGT G ACTCTTAGGT TT T AAAACGAAAATTCTTATT CTTGAGTAACTCTTT C CT GTAGGTCAGGTTGCTT T 
CTCAGGTATAGTATGAGGTCGCTCTTATTGACCACACCTCTACCGGCAGATCCGCTAGGGATAACAGGGTA 
ATATAGATCTGCCCGCCGGGAAGGCGAACCCGATCGGATGCATCCTCTCTGCTGCCATGATGCTGAAGTTG 
TCGTTGAACATGGTTGCTGCCGGCGAGGCGGTCGAGCAGGCAGTGCAGGAGGTGTTGGACTCGGGAGTCAG 
AACGGGCGACCTGCTCGGCTCGAGCrCCACTTCGGAGGTTGGCGACGCCATTGCGCTTGCAGTTAAGGAAG 
C C T T GC G C AG G C AAT C C GCAGC TGG TC T GAGCT AGC CT CGAGGAC CCTTCTCTT T AGAC TAT T C T AC TC T T 
ATGCACGTAAAAAATTCTAGG AAATATGTATTAACTAGGAGTAAAATAACCGGCTAG T GGCAT T CAT AT AG 
CCGTCTGTTTACATCTACATCACACATTTCGAGTGTATATCTCGCAACGTTGGCGTTAAATAGGCAGTCAA 
TGGCCCGACCATTCTATGGTGTTTAGGTCGATGCCATCTTTGTACGTTTAGCTTATCGATGATAAGCTGTC 
AAACATGAGAATTCTTGAAGACGAAAGGGCCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAAT 
AATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCT 
AAAT AC AT T C AAAT AT GT AT C C GC T CAT G AGAC AAT AAC C C T GAT AAAT GC T T C AAT AAT AT T G AAAAAG G 
AAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTT 
T G C T C AC CC AG AAACGC T GGT GAAAG T AAAAGAT G C T G AAG AT C AG T T G G G T GC AC G AG T G G G T TAG AT C G 
AACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACT 
TTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCAT 
ACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAG 
TAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATC 
GGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGA 
ACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGT 
TGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCG 
GAT2\AAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGC 
CGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTA 
TCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTG 
ATTAAGCATT GGTAACT GTCAGAC CAAGTTT ACTCATATATACTT TAGATT GATT TAAAACTT CATT TTT A 
ATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGT 
T C CAC T G AGC GT CAGAC C C C GT AGAAAAGAT C AA AG GAT C T T C T T GAG AT C C T T T T T T TC T GCGCGT AATC 
TGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCT 
TTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAG 
GC CAC CACTT CAAGAACTCTGTAGC ACC GC C TACATAC CTCGCTCTGCT AAT CC T GTTACCAGT GGCTGCT 
GCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTC 
GGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGSACTGAGATACCTAC 
AGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGG 
GTCGGAACAGGAGAGCGCACGAGGGAGCTT.CCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTT 
TCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCA 
GCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCC 
CCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGA 
GCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCG 
GTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATACA 
CTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGA 
CGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTCAGAG 
GTTTTCACCGTCATCACCGAAACGCGCGAGGCAGAGCTTTGAAGAAAAATGCGCCTTATTCAATCTTTGCT 
ATAAAAAATGGCCCAAAATCTCACATTGGAAGACATTTGATGACCTCATTTCTTTCAATGAAGGGCCTAAC 
GGAGTTGACTAATGTTGTGGGAAATTGGAGCGATAAGCGTGCTTCTGCCGTGGCCAGGACAACGTATACTC 
ATCAGATAACAGCAATACCTGATCACTACTTCGCACTAGTTTCTCGGTACTATGCATATGATCCAATATCA 
AAGGAAATGATAGCATTGAAGGATGAGACTAATCCAATTGAGGAGTGGCAGCATATAGAACAGCTAAAGGG 
TAGTGCTGAAGGAAGCATACGATACCCCGCATGGAATGGGATAATATCACAGGAGGTACTAGACTACCTTT 
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CAT C CTACA TAAATAGAC GCATATAAGT AC GCAT TT AAGC AT AAACAC GCACT AT GCCGTTCT T CT CAT GT 
ATAT AT ATAT AC AG G C AACAC G C AG AT ATAGGT GC G AC G T GAACAG T GAG C TGTAT GT G C GCAGC T C GC GT 
T GC AT T T T C G GA AGC G C T C G T T T T C G G A AAC GC T T T G AAG T T C C T A T T C C G A AG T T C C T A T T C T C T AG AAA 
G TAT AG G AAC T T C AG A G C G CT T T T G AAAAC C AAAAG CG C T C T G AAG AC G CAC T T T C AAAAAAC C AAAAAC G 
CACCGGACTGTAACGAGCTACTAAAATATTGCGAATACCGCTTCCACAAACATTGCTCAAAAGTATCTCTT 
TGCTATATATCTCTGTGCTATATCCCTATATAACCTACCCATCCACCTTTCGCTCCTTGAACTTGCATCTA 
AACTCGACCTCTACATTTTTTATGTTTATCTCTAGTATTACTCTTTAGACAAAAAAATTGTAGTAAGAACT 
AT T C AT AGAG T G AAT C GAAAAC AAT AC G AAAAT G T AAAC AT TT CC TAT AC G T AGTAT AT AG AG AC AAAAT A 
GAAG AAAC C GT T C AT AAT T T T C TGAC C AAT G AAGAAT CAT C AAC G C TAT CAC TTTCTGTT C AC AAAG TAT G 
CGCAATCCACATCGGTATAGAATATAATCGGGGATGCCTTTATCTTGAAAAAATGCACCCGCAGCTTCGCT 
AGTAATCAGTAAACGCGGGAAGTGGAGTCAGGCTTTTTTTATGGAAGAGAAAATAGACACCAAAGTAGCCT 
TCTTCTAACCTTAACGGACCTACAGTGCAAAAAGTTATCAAGAGACTGCATTATAGAGCGCACAAAGGAGA 
AAAAAAGTAATCTAAGATGCTTTGTTAGAAAAATAGCGCTCTCGGGATGCATTTTTGTAGAACAAAAAAGA 
AGTATAGATTCTTTGTTGGTAAAATAGCGCTCTCGCGTTGCATTTCTGTTCTGTAAAAATGCAGCTCAGAT 
TCTTTGTTTGAAAAATTAGCGCTCTCGCGTTGCATTTTTGTTTTACAAAAATGAAGCACAGATTCTTCGTT 
GGTAAAATxAGCGCTTTCGCGTTGCA>TTTCTGTTCTGTAAAAATGCAGCTCAGATTCTTTGTTTGAAAAATT 
AGCGCTCTCGCGTTGCATTTTTGTTCTACAAAATGAAGCACAGATGCTTCGTTCTGCGGTT^AAGCTCATCA 
GCGTGGTCGTGAAGCGATTCACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAG 
CGTTAATGTCTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTGATGCCT 
CCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGATGCTCACGATACGG 
GTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGTGAGGGTAAACAACTGGCGGTATGGATGCGGCG 
GGACCAGAGAAAAATCACTCAGGGTCAATGCCAGCGCTTCGTTAATACAGATGTAGGTGTTCCACAGGGTA 
GCCAGCAGCATCCTGCGATGCAGATCCGGAACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACTT 
TACGAAACACGGAAACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCT 
TCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCCCGCCAGCCTAGCCGGGTCC 
TCAACGACAGGAGCACGATCATGCGCACCCGTGGCCAGGACCCAACGCTGCGGGGGGGGGGGGGGTTTTCT 
TTCCAATTTTTT TTTTTT CGT C ATTATAGAAATCATT ACGACC GAG ATT CCCGGGTAAT AACT GATATAAT 
TAAATTGAAGCTCTAATTTGTGAGTTTAGTATACATGCATTTACTTATAATACAGTTTTTTAGTTTTGCTG 
GCCGCAT CT T C TCAAATA TGCT TCC CAGCCTGC TTTTCT GTAACGT TC ACCCTCT AC CTT AGCATCCCTTC 
CCTT TGC AAATAGTCC TC TTCCAACAAT AAT AAT GT CAG ATC CTGTAGAGAC CACATCAT C CACGGTTCTA 
TACT GTTGACCC AATGCGT CTCCC T TGTCATCT AAACCCACA CCGGGT GT CATAA TC AAC CAATCGTAAC C 
T T C AT C T C T T CCA C C CAT GT C T C T T T G AGC AAT AAA GC C GAT AAC AAA A T C T TT GT C G C T C TT C GC AAT GT 
C AAC AG T AC C C T TAG TATA T T C T C CAG T AGC T AGGG AG C C CT T GC AT G AC AA T T C T GC TAACAT CAAAAG G 
C C TC T AGGT T C C T T T G T T AC T T C T T C C G C C GCC T GC T T C AA AC CG CTAACAAT ACCT GGGC C C AC CAC AC C 
GT GT GC ATT C GT AAT G T C TGC C C AT T C T GC TAT T C T GT AT A CAC C C G CAG AG TA C T G C AAT T T GAC T GT AT 
TACCAATGTCAGCAAATTTTCTGTCTTCGAAGAGTAAAAAATTGTACTTGGCGGATAATGCCTTTAGCGGC 
TTAACT GTGC C CT CCATGGAAAAATCAGTCAAGATATCCACATGT GTTT TTAGTAAACAAATT TTGGGAC C 
TAATGCTTCAACTAACTCCAGTAATTCCTTGGTGGTACGAACATCCAATGAAGCACACAAGTTTGTTTGCT 
TTTCGTGCATGATATTAAATAGCTTGGCAGCAACAGGACTAGGATGAGTAGCAGCACGTTCCTTATATGTA 
GCTTTCGACATGATTTATCTTCGTTTCCTGCAGGTTTTTGTTGTGTGCAGTTGGGTTAAGAATACTGGGCA 
AT T T CAT GT T T C T T C AAC ACCAC AT AT GCGT AT AT AT AC CAAT C T AAG T CT Gj T QKl TC C T T C CT T C GT T CT T 
C C T T C T G C T C GGAG AT T AC C G AAT C AAAAAAAT T T C AAAGAAACC G GAAT C AAAAAJ^AAGAAC AAAAAA AA 
AAAAGATGAATTGAAACCCCCCCCCCCCCCGATGCGCCGCGTGCGGCTGCTGGAGATGGCGGACGCGATGG 
ATATGTTCTGCCAAGGGTTGGTTTGCGCATTCACAGTTCTCCGCAAGAATTGATTGGCTCCAATTCTTGGA 
GTGGTGAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCCCGGCTCCATGCACCGCGA 
CGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAACCCGTTCCATGTGCTCGC 
CGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAGTGATCGAAGTTAGGCTGGTAAGAGCCGCGAGCG 
ATCCTTGAAGCTGTCCCTGATGGTCGTCATCTACCTGCCTGGACAGCATGGCCTGCAACGCGGGCATCCCG 
ATGCCGCCGGAAGCGAGAAGAATCATAATGGGGAAGGCCATCCAGCCTCGCGTCGCGAACGCCAGCAAGAC 
GTAGCCCAGCGCGTCGGCCGCCATGCCGGCGATAATGGCCTGCTTCTCGCCGAAACGTTTGGTGGCGGGAC 
CAGTGACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCCGATCATCGTCGCG 
CTCCAGCGAAAGCGGTCCTCGCCGAAAATGACCCAGAGCGCTGCCGGCACCTGTCCTACGAGTTGCATGAT 
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AAAGAAGACAGTCATAAGTGCGGCGACGATAGTCATGCCCCGCGCCCACCGGAAGGAGCTGACTGGGTTGA 
AGGCTCTCAAGGGCATCGGTCGACGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGG 
TTGAGGCCGTTGAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGC 
CACGGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCC 
CATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGGCCACGATGCGT 

CCGGCGTAGAG 
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FIG. 17 
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ATCTACTGAG ATTTCTGGCT 
CAAGCATCCA AGCCATACTT 
CCAGTGAGAA ATT T AG AT AA 
TCTTATGCAG CTAGAAATTC 
TAACTTCAAT GTCAAAGTTG 
TCTATGAAGC GTTTTGCGGC 
CTTATGATGG TTTTTTGGAA 
ATTGCTGCAG ATTTCTCCAT 
CTTTTTTCCA ACAGTGGTTT 
TGCAGTACAT TGTAATGCAA 
CTAACCTACC AGGTTCACCT 
AT AT TAT AG A TTTTCTGATA 
GGAGCCATCT GCGCACATAA 
CTTTAATTTC ACCGTTAGCC 
TTGATCCGAC AT AC GAT GAG 
TATAGACCTA CCAAAAATCC 
CCTGAAACAA T AAC CAGAAA 
AGATCACAAA CCTACTATGA 
AAAATACAAG TGACGCTTTT 
ACAAGGCAAA TGTGGATCTG 
ACAAAAAAAG ACAAAAAAAA 
CTCAAGAAAC GTATTGTTGA 
GACCTATAAT GGTTTAAAAT 
CAGTCTTTCT CATACTCGAG 
GGCTAAACCG GCAAGTGCCT. 
GGCGGTCAGA AAAGAGCCTG 
ATGATGGGAA AGCGGGGATT 
GAGAAAT AC T GGT GGAAGT A 
AAACCCAAAG AT GAT C GAT G 
CGCTCGTGGG AACCCCACGC 
AACTCCGGTT TACGTTATAC 
GGAGGAGGGG GGAGGGGAAG 
TGAAGTGGGT TTTGCGTGGA 
CATATACGTA CATACTAGTG 
ACACTGGTAA AATACAGGAT 
TTGCTGAACA GCCGAGGTCA 
TTTCTGTGCT AGCTATGTCT 
AGCTTCATGT GCTTGCATGT 
AAGCCGCGCT AGGAGAAAAA 
AGAAAGCGTG AAATTGTTGC 
AATGCGTCTA CTGCACCTGC 
ATCTTGAAAA T AC AC AAGT G 
GTGGTGCCAT GTATGCTATT 
TACTGCAATA ATAAAAAGTA 
CGCGCAAAAA AAAAAATAAA 
AAATAATAGA AGATCTAACT 
CGTAATAGCC GATACAAAAT 
AAAGCTGCTA CGAAAGTATA 
TTTCAAGGTA CCATATATAT 
AGGTGAAGAA TTATTCACTG 
GTGATGTTAA TGGTCACAAA 



CTTTTGTTGT ACTGTCACCT 
TTTACAGCAG GAGTTACAAG 
AAC AC CAT TT CCTGCGAGTA 
TCAATTGAGC AT CAAGATAA 
AAATATTCTC CTTTAGAGCG 
AAACTCACCT TCAACTGTCA 
T TAT TAT TAT CCTACCATCA 
CTCACTTTAT ATTTGGTGGC 
GGTAGGGACC CTGACTGACA 
GACGCTGATA AACTGTTCTA 
TCAAAAGCTC TGTGTTTGGT 
GCCCTGTGTG ACATTTATGA 
CGTAAGAGTT AGCCGTGACG 
ATCAGAATAG TCGTGTTTTC 
CTCAATGATT TAGATTATGT 
AGTGCGTACA CTAATACTTT 
GAT C G GC AAA AAAATTTTTT 
CGAAAAAGCT TGAAGTTTAG 
ATATGGTGCA AGGAACAAAA 
TCATGTATGG CAACGACAGC 
CTAAGGCAAA AGAACAAAGC 
AAAACCACCG TCGTAAGAAA 
CGGCCCATTT TTTTTCCCTC 
GGAAATTCGA CACAAACAGC 
GCAAGATCCA CAGAACTAAC 
TTCCGGAAAG AGAGAAACAG 
CGGCGAAGAA CGAGACTGGA 
TTCGGACCTT TGGCGAAGTC 
ATTCATTTTT CAATGCGCTA 
AAAACATATT ATTCGCTTCT 
CGTATTAGGA TC AC TAT AAG 
AATGTACATC GTCATAAGGC 
AAATTCGTTT T CAAT GAT AT 
GCCAAAAGCG TGGGGTGGGC 
TCTATGAACA ATAACAACAA 
GCCGATGCAA CCGAGGTTTC 
GTAGGTTTAC ATTTAATGGT 
GATGTCCTGC AGATGGTAAG 
TATTCTGCTC GAAGATCTGT 
GTTCTTGCAT TACTACTCAA 
AT GAT AAAGC TTATGTATCA 
GTGCAAAGAT GTGTCACGTT 
TAACATGCAA AGGGGAAGAC 
TTTTACGCGT TACCCAATAT 
AAACAATTAC AAACAAAAAG 
GAAGC GAAGG CCAAAACTCT 
CTAGAGCAGC AACTTTTCTC 
GAAAAATCAA ACGCTCAGAA 
TTCCTTATAA CTGATGTTAA 
GTGTTGTCCC AATTTTGGTT 
TTTTCTGTCT CCGGTGAAGG 
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Fig 17 continued 

2551 TGAAGGTGAT GC TACT TAG G 

2 601 CTGGTAAATT GCCAGTTCCA 

2 651 GGTGTTCAAT GTTTTGCGAG 

27 01 TTTCAAGTCT GCCATGCCAG 

27 51 TCAAAGATGA CGGTAACTAC 

2 8 01 GATACCTTAG TTAATAGAAT 

2 8 51 TGGTAACATT TTAGGTCACA 

2 9 01 TTTACATCAT GGCTGACAAA 

2 951 ATTAGACACA ACATTGAAGA 
30 01 ACAAAATACT CCAATTGGTG 
3051 ACTTATCCAC TCAATCTGCC 
3101 CACATGGTCT TGTTAGAATT 
3151 GGATGAATTG TACAAATAAC 
32 01 TTTCTTATGA TTTATGATTT 
32 51 G T GT AT AC AA ATTTTAAAGT 
3301 TTCTTGAGTA ACTCTTTCCT 
3351 TGAGGTCGCT CTTATTGACC 
34 01 ACAGGGTAAT ATAGATCTGC 
34 51 TCCTCTCTGC TGCCATGATG 
3501 GGCGAGGCGG TCGAGCAGGC 
3551 AACGGGCGAC CTGCTCGGCT 

3 601 TTGCGCTTGC AGT TAAGGAA 

3 651 AGCTAGCCTC GAGGACCCTT 
37 01 AAAAAATTCT AGGAAAT AT G 

37 51 GGCATTCATA TAGCCGTCTG 

38 01 TATCTCGCAA CGTTGGCGTT 
38 51 TGGTGTTTAG GTCGATGCCA 
3901 CTGTCAAACA TGAGAATTCT 
3951 TTTTTATAGG TTAATGTCAT 

4 001 CACTTTTCGG GGAAATGTGC 
4 051 T AC AT T CAAA TATGTATCCG 
4101 CAATAATATT GAAAAAGGAA 
4151 CCTTATTCCC TTTTTTGCGG 
42 01 AAACGCTGGT GAAAGTAAAA 
4 251 GGTTACATCG AACTGGATCT 
4 301 CCCCGAAGAA CGTTTTCCAA 
4 351 GCGCGGTATT ATCCCGTGTT 
44 01 ATACACTATT CTCAGAATGA 
4 4 51 GCATCTTACG GATGGCATGA 
4 501 CCATGAGTGA TAACACTGCG 
4551 CCGAAGGAGC TAACCGCTTT 
4 601 CCTTGATCGT TGGGAACCGG 
4 651 GTGACACCAC GATGCCTGCA 
4 7 01 ACTGGCGAAC TACTTACTCT 
4751 GGAGGCGGAT AAAGTTGCAG 
4 8 01 GCTGGTTTAT TGCTGATAAA 
4 851 ATCATTGCAG CACTGGGGCC 
4 901 CTACACGACG GGGAGTCAGG 
4 951 CTGAGATAGG TGCCTCACTG 
5001 TACTCATATA TACTTTAGAT 
5051 GATCTAGGTG AAGATCCTTT 
5101 GTGAGTTT.TC GTTCCACTGA 
5151 TCTTCTTGAG ATCCTTTTTT 
52 01 AAAACCACCG CTACCAGCGG 
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GTAAATTGAC CTTAAAATTT ATTTGTACTA 
TGGCCAACCT TAGTCACTAC TTTCGGTTAT 
ATACCCAGAT CATATGAAAC AACATGACTT 
AAGGTTATGT TCAAGAAAGA ACTATTTTTT 
AAGACCAGAG CTGAAGTCAA GTTTGAAGGT 
C GAAT T AAAA GGTATTGATT TTAAAGAAGA 
AATTGGAATA CAACTATAAC TCTCACAATG 
CAAAAGAATG GTATCAAAGT TAACTTCAAA 
TGGTTCTGTT CAATTAGCTG AC CAT TAT C A 
ATGGTCCAGT CTTGTTACCA G AC AAC CAT T 
TTATCCAAAG ATCCAAACGA AAAGAGAGAC 
TGTTACTGCT GCTGGTATTA CCCATGGTAT 
TGCAGGGCGC GCCACTTCTA AATAAGCGAA 
TTATTATTAA ATAAGTTATA AAAAAAATAA 
GACTCTTAGG TTTTAAAACG AAAATTCTTA 
GTAGGTCAGG TTGCTTTCTC AGGTATAGTA 
ACACCTCTAC CGGCAGATCC GCTAGGGATA 
CCGCCGGGAA GGCGAACCCG ATCGGATGCA 
CTGAAGTTGT CGTTGAACAT GGTTGCTGCC 
AGT GCAGGAG GTGTTGGACT CGGGAGTCAG 
CGAGCTCCAC TTCGGAGGTT GGCGACGCCA 
GCCTTGCGCA GGCAATCCGC AGCTGGTCTG 
CTCTTTAGAC TATTCTACTC TTATGCACGT 
TATTAACTAG GAGTAAAATA ACCGGCTAGT 
TTTACATCTA CATCACACAT TTCGAGTGTA 
AAAT AG GCAG TCAATGGCCC GAC CAT T C T A 
TCTTTGTACG TTTAGCTTAT CGATGATAAG 
TGAAGACGAA AGGGCCTCGT GATACGCCTA 
GATAATAATG GTTTCTTAGA CGTCAGGTGG 
GCGGAACCCC TATTTGTTTA TTTTTCTAAA 
CTCATGAGAC AATAACCCTG ATAAATGCTT 
GAGT AT GAGT ATTCAACATT TCCGTGTCGC 
CATTTTGCCT TCCTGTTTTT GCTCACCCAG 
GATGCTGAAG ATCAGTTGGG TGCACGAGTG 
CAACAGCGGT AAGATCCTTG AGAGTTTTCG 
TGATGAGCAC TTTTAAAGTT CTGCTATGTG 
GACGCCGGGC AAGAGCAACT CGGTCGCCGC 
CTTGGTTGAG TACTCACCAG TCACAGAAAA 
CAGTAAGAGA ATTATGCAGT GC T GC C AT AA 
GCCAACTTAC T T C T GAC AAC GATCGGAGGA 
TTTGCACAAC ATGGGGGATC ATGTAACTCG 
AGCTGAATGA AGCCATACCA AACGACGAGC 
GCAATGGCAA CAACGTTGCG CAAACTATTA 
AGCTTCCCGG CAACAATTAA TAGACTGGAT 
GACCACTTCT GCGCTCGGCC CTTCCGGCTG 
TCTGGAGCCG GTGAGCGTGG GTCTCGCGGT 
AGATGGTAAG CCCTCCCGTA TCGTAGTTAT 
CAACTATGGA T G AAC G AAAT AG AC AG AT C G 
ATTAAGCATT GGTAACTGTC AGACCAAGTT 
T GAT TT AAAA CTTCATTTTT AATTTAAAAG 
TTGATAATCT CAT GAC CAAA ATCCCTTAAC 
GCGTCAGACC CCGTAGAAAA GATCAAAGGA 
TCTGCGCGTA ATCTGCTGCT TGCAAACAAA 
TGGTTTGTTT GCCGGATCAA GAGCTACCAA 
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Fig 17 continued 
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7201 


TGAAGAATCA 


7251 


ACATCGGTAT 
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7601 
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7651 
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7901 
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GAAGGTAACT GGCTTCAGCA 
TGTAGCCGTA GTTAGGCCAC 
TACCTCGCTC TGCTAATCCT 
GTCGTGTCTT ACCGGGTTGG 
AGCGGTCGGG CTGAACGGGG 
ACGACCTACA CCGAACTGAG 
CACGCTTCCC GAAGGGAGAA 
TCGGAACAGG AGAGCGCACG 
CTTTATAGTC CTGTCGGGTT 
GTGATGCTCG TCAGGGGGGC 
CCTTTTTACG GTTCCTGGCC 
CCTGCGTTAT CCCCTGATTC 
AGCTGATACC GCTCGCCGCA 
GCGAGGAAGC GGAAGAGCGC 
TGCGGTATTT CACACCGCAT 
TGCCGCATAG TTAAGCCAGT 
CATGGCTGCG CCCCGACACC 
CTTGTCTGCT CCCGGCATCC 
AGCTGCATGT GTCAGAGGTT 
GAGCTTTGAA GAAAAATGCG 
CCAAAATCTC ACATTGGAAG 
GGGCCTAACG GAGTTGACTA 
CTTCTGCCGT GGCCAGGACA 
GAT CAC TACT TCGCACTAGT 
AAAGGAAATG ATAGCATTGA 
AGCATATAGA ACAGCTAAAG 
GCATGGAATG GGATAATATC 
CATAAATAGA CGCATATAAG 
CGTTCTTCTC ATGTATATAT 
ACGTGAACAG T GAGCT GTAT 
GCTCGTTTTC GGAAACGCTT 
TAGAAAGTAT AGGAACT TC A 
AGACGCACTT TCAAAAAACC 
AAATATTGCG AATACCGCTT 
CTATATATCT CTGTGCTATA 
CTCCTTGAAC TTGCATCTAA 
CTAGTATTAC TCTTTAGACA 
TGAATCGAAA ACAATACGAA 
GAGACAAAAT AGAAGAAACC 
TCAACGCTAT CACTTTCTGT 
AGAATATAAT CGGGGATGCC 
GCTAGTAATC AGTAAACGCG 
AGAAAATAGA CACCAAAGTA 
GCAAAAAGTT ATCAAGAGAC 
AGTAATCTAA GATGCTTTGT 
TGTAGAACAA AAAAGAAG T A 
GCGTTGCATT TCTGTTCTGT 
AATTAGCGCT CTCGCGTTGC 
TCTTCGTTGG TAAAATAGCG 
TGCAGCTCAG ATTCTTTGTT 
TGTTCTACAA AATGAAGCAC 
GCGTGGTCGT GAAGCGATTC 
CTCGTTGAGT TTCTCCAGAA 
CCATGTTAAG GGCGGTTTTT 



GAGCGCAGAT ACCAAATACT 
CACTTCAAGA ACTCTGTAGC 
GTTACCAGTG GCTGCTGCCA 
ACTCAAGACG ATAGTTACCG 
GGTTCGTGCA CACAGCCCAG 
ATACCTACAG CGTGAGCTAT 
AGGCGGACAG GTATCCGGTA 
AGGGAGCTTC CAGGGGGAAA 
TCGCCACCTC TGACTTGAGC 
GGAGCCTATG GAAAAACGCC 
TTTTGCTGGC CTTTTGCTCA 
TGTGGATAAC CGTATTACCG 
GCCGAACGAC CGAGCGCAGC 
CTGATGCGGT ATTTTCTCCT 
ATGGTGCACT CTCAGTACAA 
ATACACTCCG CTATCGCTAC 
CGCCAACACC CGCTGACGCG 
GCTTACAGAC AAGCTGTGAC 
TTCACCGTCA TCACCGAAAC 
CCTTATTCAA TCTTTGCTAT 
ACATTTGATG ACCTCATTTC 
ATGTTGTGGG AAATTGGAGC 
ACGTATACTC AT C AG AT AAC 
TTCTCGGTAC TATGCATATG 
AGGATGAGAC TAATCCAATT 
GGTAGTGCTG AAGGAAGCAT 
ACAGGAGGTA CTAGACTACC 
TACGCATTTA AGCATAAACA 
ATATACAGGC AAC AC GCAGA 
GTGCGCAGCT CGCGTTGCAT 
TGAAGTTCCT ATTCCGAAGT 
GAGCGCTTTT GAAAACCAAA 
AAAAACGCAC CGGACTGTAA 
C C AC AAAC AT TGCTCAAAAG 
TCCCTATATA ACCTACCCAT 
ACTCGACCTC TACATTTTTT 
AAAAAATTGT AGTAAGAACT 
AATGTAAACA TTTCCTATAC 
GTTCATAATT TTCTGACCAA 
TCACAAAGTA TGCGCAATCC 
TTTATCTTGA AAAAAT GC AC 
GGAAGTGGAG TCAGGCTTTT 
GCCTTCTTCT AACCTTAACG 
TGCATTATAG AGCGCACAAA 
TAGAAAAATA GCGCTCTCGG 
TAGATTCTTT GTTGGTAAAA 
AAAAAT GCAG CTCAGATTCT 
ATTTTTGTTT TAC AAAAAT G 
CTTTCGCGTT GCATTTCTGT 
T GAAAAAT T A GCGCTCTCGG 
AGATGCTTCG TTCTGCGGTA 
ACAGATGTCT GCCTGTTCAT 
GCGTTAATGT CTGGCTTCTG 
TCCTGTTTGG TCACTGATGC 
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Figure 17 continued 

7951 CTCCGTGTAA GGGGGATTTC 

8 001 GAGAGAGGAT GCTCACGATA 

8 051 CTGGAACGTT GTGAGGGTAA 

8101 GAGAAAAATC ACTCAGGGTC 

8151 GTGTTCCACA GGGTAGCCAG 

82 01 ATGGTGCAGG GCGCTGACTT 

82 51 ACCGAAGACC ATTCATGTTG 

8301 AGTCGCTTCA CGTTCGCTCG 

8351 AGGCAACCCC GCCAGCCTAG 

84 01 GCGCACCCGT GGCCAGGACC 

8 4 51 TCCAATTTTT TTTTTTTCGT 

8501 CGGGTAATAA CTGATATAAT 

8551 AT AC AT G CAT TTACTTATAA 

8 601 TCTCAAATAT GCTTCCCAGC 

8 651 TAGCATCCCT TCCCTTTGCA 

87 01 GATCCTGTAG AGACCACATC 

8751 GTCTCCCTTG T CAT C T AAAC 

8 8 01 AACCTTCATC TCTTCCACCC 

8 8 51 AAATCTTTGT CGCTCTTCGC 

8 901 AGTAGCTAGG GAGCCCTTGC 
8951 TAGGTTCCTT TGTTACTTCT 
9001 CCTGGGCCCA CCACACCGTG 
9051 TCTGTATACA CCCGCAGAGT 
9101 CAAATTTTCT GTCTTCGAAG 
9151 TTTAGCGGCT TAACTGTGCC 

92 01 ATGTGTTTTT AGTAAACAAA 
9251 GTAATTCCTT GGTGGTACGA 

93 01 TTTTCGTGCA TGATATTAAA 
9351 AGCAGCACGT TCCTTATATG 

94 01 TGCAGGTTTT TGTTCTGTGC 
94 51 GTTTCTTCAA CACCACATAT 
9501 CTTCCTTCGT TCTTCCTTCT 
9551 AAAGAAACCG GAATCAAAAA 

9 601 ACCCCCCCCC CCCCCGATGC 
9 651 CGATGGATAT GTTCTGCCAA 
97 01 AAGAATTGAT TGGCTCCAAT 

97 51 CCGCCGGCTT CCATTCAGGT 

98 01 CAACGCGGGG AGGCAGACAA 
98 51 ACCCGTTCCA TGTGCTCGCC 
9901 GGTCCAGTGA TCGAAGTTAG 
9951 CTGTCCCTGA TGGTCGTCAT 

10001 CGGGCATCCC GATGCCGCCG 

100 51 ATCCAGCCTC GCGTCGCGAA 

10101 CGCCATGCCG GCGATAATGG 

10151 GACCAGTGAC GAAGGCTTGA 

10201 AGCGACAGGC CGATCATCGT 

10251 AATGACCCAG AGCGCTGCCG 

10301 AGACAGTCAT AAGTGCGGCG 

10351 GAGCTGACTG GGTTGAAGGC 

104 01 ATGCGACTCC TGCATTAGGA 

104 51 GCACCGCCGC CGCAAGGAAT 

10501 CCCCCGGCCA CGGGGCCTGC 

10551 GAGCCCGAAG TGGCGAGCCC 

10 601 AGGCGCCAGC AACCGCACCT 

10 651 CCGGCGTAGA G 
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TGTTCATGGG GGTAAT GAT A CCGATGAAAC 
CGGGTTACTG AT GAT G AAC A TGCCCGGTTA 
ACAACTGGCG GTATGGATGC GGCGGGACCA 
AATGCCAGCG CTTCGTTAAT ACAGATGTAG 
CAGCATCCTG CGATGCAGAT CCGGAACATA 
CCGCGTTTCC AGACTTTACG AAACACGGAA 
TTGCTCAGGT CGCAGACGTT TTGCAGCAGC 
CGTATCGGTG ATTCATTCTG CTAACCAGTA 
CCGGGTCCTC AACGACAGGA GCACGATCAT 
CAACGCTGCG GGGGGGGGGG GGGTTTTCTT 
CAT TATAGAA AT CAT T AC GA CCGAGATTCC 
TAAATTGAAG CTCTAATTTG TGAGTTTAGT 
TACAGTTTTT TAGTTTTGCT GGCCGCATCT 
CTGCTTTTCT GTAACGTTCA CCCTCTACCT 
AATAGTCCTC TTCCAACAAT AATAATGTCA 
ATCCACGGTT CTATACTGTT GACCCAATGC 
CCACACCGGG TGTCATAATC AACCAATCGT 
ATGTCTCTTT GAG C AAT AAA GCCGATAACA 
AATGTCAACA GTACCCTTAG TATATTCTCC 
ATGACAATTC T GCTAAC AT C AAAAGGCCTC 
TCCGCCGCCT GCTTCAAACC GCTAACAATA 
TGCATTCGTA ATGTCTGCCC ATTCTGCTAT 
ACTGCAATTT GACTGTATTA CCAATGTCAG 
AGTAAAAAAT TGTACTTGGC GGATAATGCC 
CTCCATGGAA AAATCAGTCA AGATATCCAC 
TTTTGGGACC TAATGCTTCA ACTAACTCCA 
ACATCCAATG AAGCACACAA GTTTGTTTGC 
TAGCTTGGCA GCAACAGGAC TAG GAT GAG T 
TAGCTTTCGA CATGATTTAT CTTCGTTTCC 
AGTTGGGTTA AGAATACTGG GCAATTTCAT 
GCGTATATAT AC C AAT C T AA GTCTGTGCTC 
GCTCGGAGAT TACCGAATCA AAAAAATTTC 
AAAGAACAAA AAAAAAAAAG ATGAATTGAA 
GCCGCGTGCG GCTGCTGGAG ATGGCGGACG 
GGGTTGGTTT GCGCATTCAC AGTTCTCCGC 
TCTTGGAGTG GTGAATCCGT TAGCGAGGTG 
CGAGGTGGCC CGGCTCCATG CACCGCGACG 
GGTATAGGGC GGCGCCTACA ATCCATGCCA 
GAGGCGGCAT AAATCGCCGT GACGATCAGC 
GCTGGTAAGA GCCGCGAGCG ATCCTTGAAG 
CTACCTGCCT GGACAGCATG GCCTGCAACG 
GAAGCGAGAA GAAT CAT AAT GGGGAAGGCC 
CGCCAGCAAG ACGTAGCCCA GCGCGTCGGC 
CCTGCTTCTC GCCGAAACGT TTGGTGGCGG 
GCGAGGGCGT GCAAGATTCC GAATACCGCA 
CGCGCTCCAG CGAAAGCGGT CCTCGCCGAA 
GCACCTGTCC TACGAGTTGC AT G AT AAAGA 
ACGATAGTCA TGCCCCGCGC CCACCGGAAG 
TCTCAAGGGC ATCGGTCGAC GCTCTCCCTT 
AGCAGCCCAG TAGTAGGTTG AGGCCGTTGA 
GGTGCATGCA AGGAGAT GGC GCCCAACAGT 
CACCATACCC ACGCCGAAAC AAGCGCTCAT 
GATCTTCCCC ATCGGTGATG TCGGCGATAT 
GTGGCGCCGG TGATGCCGGC CACGATGCGT 
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FIG, 18 

Cytotoxicity Profile: 
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FIG. 19 

Cytotoxicity Profile: 
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- 


4* 


- 


Hydrazine rnonohydrate 


4<4- 






Nitrofurantoin 


- 


+ 




Hydroxyurea 


++ 


+ 




N-Nitrosodimethylamine 


- 


4- 




Methyl methanesulfonate 


4-4- 


+ 


- 


•4,4-Oxydianiline 


- 


4- 


- 


MNNG 


4-4 






Quercetin 




4- 


- 


Nalidixic Acid 


+4 






Sodium Selenite 


- 


+ 


- 


4-Nitroquinoline-N-oxide 


4-4- 






Acetyisalicylic Acid 


- 


- 




N-Nitroso-N-ethyl urea 


4-4 






Actinomycin D 


- 


- 




N-Nitroso-N-m ethyl urea 


4-+ 






4-AminophenoI 


- 


- 




Sodium Azide 


+* 






AmpfcilKn (Na salt) 


- 


- 




Streptonigrin 


4-+ 


+ 


* 


Aniline 


- 






Trichioroacetonitrile 


** 


4- 




AraC 


- 


- 




Benzo(a)pyrene 


4* 


4- 


* 


AZT 


- 


- 




1-Naphthylamine 


t 


4- 




Cadmium Chloride 




•* 




Benzoyl Chloride 


4* 


4- 




Caffeine 


• 






Cisplatin (without DMSO) 




+ 




Chromomycin A3 


•* 






1,2-Epoxybutane 




4* 




Cyclohextmide 








Hexarnethylenetetramine 


4- 


4- 




3,5-DichioropherioI 








Hydrogen Peroxide 


+ 


4* * 


** 


Dicumyl Peroxide 








Mechlorethamine HCI 


4~ 


4* 




Dieldrin 




** 




Mitomycin C 


4* 


+• 




DiethyIamino-4-methyIcoumarin 


■ 






3-Amino-1 7 2,44riazole 


+ 






EthylAcrylate 








Aphidicolin 


4* 






Ethylenediamine 








Benzaldehyde 


+ 


I ~~ 




Methyl Carbamate 


** 






Colchicine 


4« 






Methyl Methacrylate 








Etoposide 


4* 






Nicotine 




** 




Methyl viologen 


4- 






Nitrobenzene 








Psoralen 


+ 






Phenol 








Catechol 


4-4- 


** 




Sulfisoxazole 








Chloramphenicol 


+4- 


" 




Taxol 








1,2-Dimethylhydrazine HCI 


4-4- 






Tetracycline HCI 








Econazole Nitrate 


4-4- 






Titanium Dioxide 








Methapyrilene HCI 


+4- 






Vanillin ' 








Phthalic acid bis(2-ethylhexyl) ester 








Vinblastine 








Safrole 


++ 






S9 column records requirement for Ames result* 
4- S9 required 

- S9 not required Table 3 


Sulfamethoxazole 


+4* 






Thiourea 


4*4« 






Tritolyi Phosphate 


4-4- 






Urethane 


4-4- 
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FIG. 22 
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FIG. 23 



Micropiate layout 
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FIG. 24 



I BamHI 
! I .Eco Rl 




AatW 



WO 2005/012533 



PCT/GB2004/003337 




WO 2005/012533 



PCT/GB2004/003337 



34/61 

FIG. 26 

pGenRNR2 

GGATCCGTACCTTCCAGCATTGTCCTTCTGAGAAAACAAAAATGG7VAGATGTTGTGAAAATGCAGTAAGTGA 
CAATAAGCTAGCGACTTTTCAATGATCGCAGATTAAGTATATATACACTTTAGGTCTCCTTCTCTTCTTTAC 
TAGCTACGAAAAAACAAAGAAAAAAAAAACAAAATAGAGGGAAAAAAGA 

GGGGAGGTTCGAACTCTCGACCTTCAGATTATGAGACTGACGCTCTTCCTACTGAGCTACTGAAGCACTATT 
TCTTGTTCCAACCGAGGAATAATACTAGACCTGTAGAAAAGTACCGCTTATCTAATATCCAACTTTTTCTTC 
TGTTCCCTTTAGTGAATTCAAGATGAAGGAACATAACAGAACGGAAAACAAAAAAAAAGGGAAGAAAAAAGA 
TATCACCACCAATTGGCACGCTCGCAACGGTTTCGGGCCAGATTCATGGCCCTGTGGGGTAGAAGTGCCTAC 
AGACAAAAAACCGTTACGTCCCGCCTCACCAAGCATCGACACACAAGCCCTCTCAACCTCCTTAATTTCTTT 
ATCTTTTTTTCCCTTCACCTCTGTGCTCTTTTTTTAGCCACAGCTGTGCATTACGCTTGCTTTGCTTGTTTT 
GTTCTCTTTCGCCATGCCATATTACTACTTTTCTACTTACTGGCACGAGGCCGCGCAAGCCAGATCCAAGCA 
CGCCAGAAAGTGCGGTGTACTGGTGCAACCT.TTTATCGATTTCTCATCATATCGCTATCGCAACGGGCATGG 
GCGACGAAAAAGCCAATCTAAAGCGTTGTCTTCCTCGTTCCTCGCGCACGCGATTTTGCGTTCTGCATAGGA 
AGCCGAAGTCGAACAAGAAGCAGGCAAAGTTTAGAGCACTGCCCCTCCGCACTCAAAAAAGAAAAAACTAGG 
AGGAAAATAAAATTCTCAACCACACAAACACATAAACACATACAAATACAAATACAAGCTTATTTACTTGAC 
ATCGCGCGATCTTCCACTATTCAGCGCCGTCCGCCCTCTCTCGTGTTTTTTGTTTACGCGACAACTATGCGA 
AATCCGGAGCAACGGGCAACCGTTTGGGGAAAGACCACACCCACGCGCGATCGCCATGGCAACGAGGTCGCA 
CACGCCCCACACCCAGACCTCCCTGCGAGCGGGCATGGGTACAATGTCCCCGTTGCCACAGAGACCACTTCG 
TAGCACAGCGCAGAGCGTAGCGTGTTGTTGCTGCTGACAAAAGAAAATTTTTCTTAGCAAAGCAAAGGAGGG 
GAAGCACGGGCAGATAGCACCGTACCATACCCTTGGAAACTCGAAATGAACGAAGCAGGAAATGAGAGAATG 
AGAGTTTTGTAGGTATATATAGCGGTAGTGTTTGCGCGTTACCATCATCTTCTGGATCTATCTATTGTTCTT 
TTCCTCATCACTTTCCCCTTTTTCGCTCTTCTTCTTGTCTTTTATTTCTTTCTTTTTTTTAATTGTTCCCTC 
GAT T G G C T AT C T AC C AAAGAAT C CAAAC T T AAT AC AC GT AT T TAT T T GT C C AAT TAC CAT GT T AAT T AAC T C 
TAAAGGTGAAGAATTATTCACTGGTGTTGTCCCAATTTTGGTTGAATTAGATGGTGATGTTAATGGTCACAA 
ATTTTCTGTCTCCGGTGAAGGTGAAGGTGATGCTACTTACGGTAAATTGACCTTAAAATTTATTTGTACTAC 
TGGTAAATTGCCAGTTCCATGGCCAACCTTAGTCACTACTTTCGGTTATGGTGTTCAATGTTTTGCGAGATA 
C C CAGAT C ATAT GAAAC AAC AT GACT T T T T C AAGT CT GC CAT GC CAGAAG GT T AT GT T C AAGAAAG AAC TAT 
TTTTTTCAAAGATGACGGTAACTACAAGACCAGAGCTGAAGTCAAGTTTGAAGGTGATACCTTAGTTAATAG 
AATCGAATTAAAAGGTATTGATTTTAAAGAAGATGGTAACATTTTAGGTCACAAATTGGAATACAACTATAA 
C T C T C AC AAT G T T TAC AT CAT GG C T G AC AAAC AAAAG AAT GG T AT C AAAG T T AAC T T C AAAAT TAG AC AC AA 
CATTGT^GATGGTTCTGTTCAATTAGCTGACCATTATCAACAAAATACTCCAATTGGTGATGGTCCAGTCTT 
GTTACCAGACAACCATTACTTATCCACTCAATCTGCCTTATCCAAAGATCCAAACGAAAAGAGAGACCACAT 
GGTCTTGTTAGAATTTGTTACTGCTGCTGGTATTACCCATGGTATGGATGAATTGTACAAATAACTGCAGGG 
CGCGCCACTTCTAAATAAGCGAATTTCTTATGATTTATGATTTTTATTATTAAATAAGTTATAAAAAAAATA 
AGTGTATACAAATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTTATTCTTGAGTAACTCTTTCCTG 
TAGGTCAGGTTGCTTTCTCAGGTATAGTATGAGGTCGCTCTTATTGACCACACCTCTACCGGCAGATCCGCT 
AGGGATAACAGGGTAATATAGATCTGCCCGCCGGGAAGGCGAACCCGATCGGATGCATCCTCTCTGCTGCCA 
TGATGCTGAAGTTGTCGTTGAACATGGTTGCTGCCGGCGAGGCGGTCGAGCAGGCAGTGCAGGAGGTGTTGG 
ACTCGGGAGTCAGAACGGGCGACCTGCTCGGCTCGAGCTCCACTTCGGAGGTTGGCGACGCCATTGCGCTTG 
CAGTTAAGGAAGCCTTGCGCAGGCAATCCGCAGCTGGTCTGAGCTAGCCTCGAGGACCCTTCTCTTTAGACT 
ATTCTACTCTTATGCACGTAAAAAATTCTAGGAAATATGTATTAACTAGGAGTAAAATAACCGGCTAGTGGC 
ATTCATATAGCCGTCTGTTTACATCTACATCACACATTTCGAGTGTATATCTCGCAACGTTGGCGTTAAATA 
GGCAGTCAATGGCCCGACCATTCTATGGTGTTTAGGTCGATGCCATCTTTGTACAGCTTGCCTCGTCCCCGC 
CGGGTCACCCGGCCAGCGACATGGAGGCCCAGAATACCCTCCTTGACAGTCTTGACGTGCGCAGCTCAGGGG 
CATGATGTGACTGTCGCCCGTACATTTAGAGCAAAAATTACGGCTCCTCGCTGCAGACCTGCGAGCAGGGAA 
ACGCTCCCCTCACAGACGCGTTGAATTGTCCCCACGCCGCGCCCCTGTAGAGAAATATAAAAGGTTAGGATT 
TGCCACTGAGGTTCTTCTTTCATATACTTCCTTTTAAAATCTTGCTAGGATACAGTTCTCACATCACATCCG 
AACATAAACAACCATGGGTAAGGAAAAGACTCACGTTTCGAGGCCGCGATTAAATTCCAACATGGATGCTGA 
TTTATATGGGTATAAATGGGCTCGCGATAATGTCGGGCAATCAGGTGCGACAATCTATCGATTGTATGGGAA 

GCCCGATGCGCCAGAGTTGTTTC 
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TGAAACATGGCAAAGGTAGCGTTGCCAATGATGTTACAGATGAGATGGTCAGACTAAACTGGCTGACGGAAT 

TTATGCCTCTTCCGACCATCAAGCATTTTATCCGTACTCCTGATGATGCATGGTTACTCACCACTGCGATCC 

CCGGCAAAACAGCATTCCAGGTATTAGAAGAATATCCTGATTCAGGTGAAAATATTGTTGATGCGCTGGCAG 

TGTTCCTGCGCCGGTTGCATTCGATTCCTGTTTGTAATTGTCCTTTTAACAGCGATCGCGTATTTCGTCTCG 

CTCAGGCGCAATCACGAATGAATAACGGTTTGGTTGATGCGAGTGATTTTGATGACGAGCGTAATGGCTGGC 

CTGTTGAACAAGTCTGGAAAGAAATGCATAAGCTTTTGCCATTCTCACCGGATTCAGTCGTCACTCATGGTG 

ATTTCTCACTTGATAACCTTATTTTTGACGAGGGGAAATTAATAGGTTGTATTGATGTTGGACGAGTCGGAA 

TCGCAGACCGATACCAGGATCTTGCCATCCTATGGAACTGCCTCGGTGAGTTTTCTCCTTCATTACAGAAAC 

GGCTTTTTCAAAAATATGGTATTGATAATCCTGATATGAATAAATTGCAGTTTCATTTGATGCTCGATGAGT 

TTTTCTAATCAGTACTGACAATAAAAAGATTCTTGTTTTCAAGAACTTGTCATTTGTATAGTTTTTTTATAT 

TGTAGTTGTTCTATTTTAATCAAATGTTAGCGTGATTTATATTTTTTTTCGCCTCGACATCATCTGCCCAGA 

TGCGAAGTTAAGTGCGCAGAAAGTAATATCATGCGTCAATCGTATGTGAATGCTGGTCGCTATACTGCTGTC 

GATTCGATACTAACGCCGCCATCCAGTGTCGACTAGGGTTGCTGCCATCGGCCTCGCTCGCGTCTTTGCCGG 

ATAGCAAGAGCGCCTTTGGCCTCTACGAGCCCTGCCACGGCTCTGCGCCCGATCTGCCCGCCGGGAAGGCGA 

ACCCGATCGGATGCATCCTCTCTGCTGCCATGATGCTGAAGTTGTCGTTGAACATGGTTGCTGCCGGCGAGG 

CGGTCGAGCAGGCAGTGCAGGAGGTGTTGGACTCGGGAGTCAGAACGGGCGACCTGCTCGGCTCGAGCTCCA 

CTTCGGAGGTTGGCGACGCCATTGCGCTTGCAGTTAAGGAAGCCTTGCGCAGGCAATCCGCAGCTGGTCTGA 

GCTAGCCTCGAGGACCCTTCTCTTTAGACTATTCTACTCTTATGCACGTAAAAAATTCTAGGAAATATGTAT 

TAACTAGGAGTAAAATAACCGGCTAGTGGCATTCATATAGCCGTCTGTTTACATCTACATCACACATTTCGA 

GTGTATATCTCGCAACGTTGGCGTTAAATAGGCAGTCAATGGCCCGACCATTCTATGGTGTTTAGGTCGATG 

CCATCTTTGTACGTTTAGCTTATCGATGATAAGCTGTCAAACATGAGAATTCTTGAAGACGAAAGGGCCTCG 

TGATACGCCTATTTTTATAGGTTAATGTCATGAT7VATAATGGTTTCTTAGACGTCAGGTGGCACTTTTCGGG 

GAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAAT 

AACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTA 

TTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTG 

AAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTC 

GCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTG 

ACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCA 

CAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACA 

CTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGG 

ATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCA 

CGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTCCCGGC 

AACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCT 

GGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGATG 

GTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGA 

TCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGA 

TTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAA 

TCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATC 

CTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGG 

ATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTC 

TAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCC 

TGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGG 

ATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCG 

AACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATC 

CGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATA 

GTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTAT 

GGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTC 

CTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTCGCCGCAGCC 

GAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTCTCCTTACGC 

ATCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCC 

AGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACACCCGCTGACGC 

GCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGT^ACCGTCTCCGGGAGCTGCATGTG 

TCAGAGGTTTTCACCGTCA 
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TCACCGAAACGCGCGAGGCAGAGCTTTGAAGAAAAATGCGCCTTATTCAATCTTTGCTATAJVAAAATGGCCC 

AAAATCTCACATTGGAAGACATTTGATGACCTCATTTCTTTCAATGAAGGGCCTAACGGAGTTGACTAATGT 

TGTGGGAAATTGGAGCGATAAGCGTGCTTCTGCCGTGGCCAGGACAACGTATACTCATCAGATAACAGCAAT 

AC C T GAT CAC T AC T T C G C AC TAG T T T C T C G GT ACT AT GC AT AT GAT C C AAT AT C AAAGGAAAT GAT AGC AT T 

GAAGGATGAGACTAATCCAATTGAGGAGTGGCAGCATATAGAACAGCTAAAGGGTAGTGCTGAAGGAAGCAT 

ACGATACCCCGCATGGAATGGGATAATATCACAGGAGGTACTAGACTACCTTTCATCCTACATAAATAGACG 

C AT AT AAG T AC G CAT T T AAGC AT AAAC AC G CAC T AT GC C GT T C T T CT C AT G TAT AT AT AT AT ACAG GC AAC A 

CGCAGATATAGGTGCGACGTGAACAGTGAGCTGTATGTGCGCAGCTCGCGTTGCATTTTCGGAAGCGCTCGT 

TTTCGGAAACGCTTTGAAGTTCCTATTCCGAAGTTCCTATTCTCTAGAAAGTATAGGAACTTCAGAGCGCTT 

TTGAAAACCAAAAGCGCTCTGAAGACGCACTTTCAAAAAACCAAAAACGCACCGGACTGTAACGAGCTACTA 

AAATATTGCGAATACCGCTTCCACAAACATTGCTCAAAAGTATCTCTTTGCTATATATCTCTGTGCTATATC 

CCTATATAACCTACCCATCCACCTTTCGCTCCTTGAACTTGCATCTAAACTCGACCTCTACATTTTTTATGT 

TTATCTCTAGTATTACTCTTTAGACAAAAAAATTGTAGTAAGAACTATTCATAGAGTGAATCGAAAACAATA 

CGAAAATGTAAACATTTCCTATACGTAGTATATAGAGACAAAATAGAAGAAACCGTTCATAATTTTCTGACC 

AATGAAGAATCATCAACGCTATCACTTTCTGTTCACAAAGTATGCGCAATCCACATCGGTATAGAATATAAT 

CGGGGATGCCTTTATCTTGAAAAAATGCACCCGCAGCTTCGCTAGTAATCAGTAAACGCGGGAAGTGGAGTC 

AGGCTTTTTTTATGGAAGAGAAAATAGACACCAAAGTAGCCTTCTTCTAACCTTAACGGACCTACAGTGCAA 

AAAGTTATCAAGAGACTGCATTATAGAGCGCACAAAGGAGAAAAAAAGTAATCTAAGATGCTTTGTTAGAAA 

AATAGCGCTCTCGGGATGCATTTTTGTAGAACAAAAAAGAAGTATAGATTCTTTGTTGGTAAAATAGCGCTC 

TCGCGTTGCATTTCTGTTCTGTAAAAATGCAGCTCAGATTCTTTGTTTGAAAAATTAGCGCTCTCGCGTTGC 

ATTTTTGTTTTACAAAAATGAAGCACAGATTCTTCGTTGGTAAAATAGCGCTTTCGCGTTGCATTTCTGTTC 

TGTAAAAATGCAGCTCAGATTCTTTGTTTGAAAAATTAGCGCTCTCGCGTTGCATTTTTGTTCTACAAAATG 

AAGCACAGATGCTTCGTTCTGCGGTAAAGCTCATCAGCGTGGTCGTGAAGCGATTCACAGATGTCTGCCTGT 

TCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGTCTGGCTTCTGATAAAGCGGGCCATGTTA 

AGGGCGGTTTTTTCCTGTTTGGTCACTGATGCCTCCGTGTAAGGGGGATTTCTGTTCATGGGGGTAATGATA 

CCGATGAAACGAGAGAGGATGCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGGAACGTTGT 

GAGGGTAAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCCAGCGCTTC 

GTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCCTGCGATGCAGATCCGGAACATAATGGTG 

CAGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAAACACGGAAACCGAAGACCATTCATGTTGTTGCTCAG 

GTCGCAGACGTTTTGCAGCAGCAGTCGCTTCACGTTCGCTCGCGTATCGGTGATTCATTCTGCTAACCAGTA 

AGGCAACCCCGCCAGCCTAGCCGGGTCCTCAACGACAGGAGCACGATCATGCGCACCCGTGGCCAGGACCCA 

ACGCTGCGGGGGGGGGGGGGGTTTTCTTTCCAATTTTTTTTTTTTCGTCATTATAGAAATCATTACGACCGA 

GATTCCCGGGTAATAACTGATATAATTAAATTGAAGCTCTAATTTGTGAGTTTAGTATACATGCATTTACTT 

ATAATACAGTTTTTTAGTTTTGCTGGCCGCATCTTCTCAAATATGCTTCCCAGCCTGCTTTTCTGTAACGTT 

CACCCTCTACCTTAGCATCCCTTCCCTTTGCAAATAGTCCTCTTCCAACAATAATAATGTCAGATCCTGTAG 

AGACCACATCATCCACGGTTCTATACTGTTGACCCAATGCGTCTCCCTTGTCATCTAAACCCACACCGGGTG 

TCATAATCAACCAATCGTAACCTTCATCTCTTCCACCCATGTCTCTTTGAGCAATAAAGCCGATTVACAAAAT 

CTTTGTCGCTCTTCGCAATGTCAACAGTACCCTTAGTATATTCTCCAGTAGCTAGGGAGCCCTTGCATGACA 

ATTCTGCTAACATCAAAAGGCCTCTAGGTTCCTTTGTTACTTCTTCCGCCGCCTGCTTCAAACCGCTAACAA 

TACCTGGGCCCACCACACCGTGTGCATTCGTAATGTCTGCCCATTCTGCTATTCTGTATACACCCGCAGAGT 

ACTGCAATTTGACTGTATTACCAATGTCAGCAAATTTTCTGTCTTCGAAGAGTAAAAAATTGTACTTGGCGG 

ATAATGCCTTTAGCGGCTTAACTGTGCCCTCCATGGAAAAATCAGTCAAGATATCCACATGTGTTTTTAGTA 

AACAAATTTTGGGACCTAATGCTTCAACTAACTCCAGTAATTCCTTGGTGGTACGAACATCCAATGAAGCAC 

ACAAGTTTGTTTGCTTTTCGTGCATGATATTAAATAGCTTGGCAGCAACAGGACTAGGATGAGTAGCAGCAC 

GTTCCTTATATGTAGCTTTCGACATGATTTATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGCAGTTGGGTTA 

AGAATACTGGGCAATTTCATGTTTCTTCAACACCACATATGCGTATATATACCAATCTAAGTCTGTGCTCCT 

TCCTTCGTTCTTCCTTCTGCTCGGAGATTACCGAATCAAAAAAATTTCAAAGAAACCGGAATCAAAAAAAAG 

AACAAAAAAAAAAAAGATGAATTGAAACCCCCCCCCCCCCCGATGCGCCGCGTGCGGCTGCTGGAGATGGCG 

GACGCGATGGATATGTTCTGCCAAGGGTTGGTTTGCGCATTCACAGTTCTCCGCAAGAATTGATTGGCTCCA 

ATTCTTGGAGTGGTGAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCCCGGCTCCATG 

CACCGCGACGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAACCCGTTCCATG 

TGCTCGCCGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAGTGATCGAAGTTAGGCTGGTAAGAGCCG 

CGAGCGATCCTTGAAGCTGTCCCTGATGGTCGTCATCTACCTGCCTGGACAGCATGGCCTGCAACGCGGGCA 

TCCCGATGCCGCCGGAAG 
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CGAGAAGAATCATAATGGGGAAGGCCATCCAGCCTCGCGTCGCGAACGCCAGCAAGACGTAGCCCAGCGCGT 
CGGCCGCCATGCCGGCGATAATGGCCTGCTTCTCGCCGAAACGTTTGGTGGCGGGACCAGTGACGAAGGCTT 
GAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCCGATCATCGTCGCGCTCCAGCGAAAGCGGT 
CCTCGCCGAAAATGACCCAGAGCGCTGCCGGCACCTGTCCTACGAGTTGCATGATAAAGAAGACAGTCATAA 
GTGCGGCGACGATAGTCATGCCCCGCGCCCACCGGAAGGAGCTGACTGGGTTGAAGGCTCTCAAGGGCATCG 
GTCGACGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCG 
CCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGTCCCCCGGCCACGGGGCCTGCCACCATAC 
CCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGATCTTCCCCATCGGTGATGTCGGCGATAT 
AGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGGCCACGATGCGTCCGGCGTAGA 
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FIG. 27 

PGenRNR3 

GGATCCAGAAACAAGAGAAGGTAACAAGCACATAAAAAATCAGCACATACGTACATACATAAGAATGAATCG 
C AC GCAC GC GT AAAC AT T TAT CAT T T AAT C T T C AG T T G T TAGAT AAAAAAAAAAAGAAAAGAAAAGAAAGT G 
AAGGCTTGTTTCAGTTTGAACTAGGTAGCAGAGCAAGCCCTCGTTCTTGGCTGCTAATTTTCCTAAAGTAGT 
AAAAAAAGCCAAGTTATCTGCCTACGGTTGTCACAGCAACATTGCGTGCCGTTGTTCTTTTGTTTTTTTTTT 
TTTTTTTTTTTCGTGGTTGTCGCAGCAACGACACCTAGGCGCTGCTCAAAGGGGCAAAAACCCGGTTGCCAT 
G GC GAGGAC CAAAC GAC AAGAT G G GAAAAAAACAAT AGT C TAT T G T T AAAT C G T AAT AC T GT AT T GT GAGAT 
GCTGACGCGTTTCGTTTTTCGTGTCAGCGTTCTTTATATTGTTTCGTGTTCTGCTGCAAAACGTATATAAAC 
GCACTGCTATTTTGCCTTCTTTTGCCTTCTTCCTTGCCTTTCTCTCATCTCATATCCAAGTTGAAATAAATA 
TGACAAGCAAGAATAGCAGCAGCAATAAATCAAATACTCCCACACAAATGTTAATTAACTCTAAAGGTGAAG 
AATTATTCACTGGTGTTGTCCCAATTTTGGTTGAATTAGATGGTGATGTTAATGGTCACAAATTTTCTGTCT 
CCGGTGAAGGTGAAGGTGATGCTACTTACGGTAAATTGACCTTAAAATTTATTTGTACTACTGGTAAATTGC 
CAGTTCCATGGCCAACCTTAGTCACTAGTTTCGGTTATGGTGTTCAATGTTTTGCGAGATACCCAGATCATA 
TGAAACAACATGACTTTTTCAAGTCTGCCATGCCAGAAGGTTATGTTCAAGAAAGAACTATTTTTTTCAAAG 
ATGACGGTAACTACAAGACCAGAGCTGAAGTCAAGTTTGAAGGTGATACCTTAGTTAATAGAATCGAATTAA 
AAG GT AT T GAT T T T AAAGAAGAT GGT AAC AT T T TAG GT C AC AAAT T GGAAT AC AAC T ATAAC T C T C AC AAT G 
T T T AC AT C AT GGC T GAC AAACAAAAGAAT G GT AT C AAAGT T AAC T T C AAAAT T AGACAC AAC AT T GAAGAT G 
GTTCTGTTCAATTAGCTGACCATTATCAACAAAATACTCCAATTGGTGATGGTCCAGTCTTGTTACCAGACA 
ACCATTACTTATCCACTCAATCTGCCTTATCCAAAGATCCAAACGAAAAGAGAGACCACATGGTCTTGTTAG 
AATTTGTTACTGCTGCTGGTATTACCCATGGTATGGATGAATTGTACAAATAACTGCAGGGCGCGCCACTTC 
TAAAT AAG C GAAT T T C T TAT GAT T TAT GAT T T T TAT TAT T AAAT AAG T TAT AAAAAAAAT AAG T G T AT AC AA 
ATTTTAAAGTGACTCTTAGGTTTTAAAACGAAAATTCTTATTCTTGAGTAACTCTTTCCTGTAGGTCAGGTT 
GCTTTCTCAGGTATAGTATGAGGTCGCTCTTATTGACCACACCTCTACCGGCAGATCCGCTAGGGATAACAG 
GGTAATATAGATCTGCCCGCCGGGAAGGCGAACCCGATCGGATGCATCCTCTCTGCTGCCATGATGCTGAAG 
TTGTCGTTGAACATGGTTGCTGCCGGCGAGGCGGTCGAGCAGGCAGTGCAGGAGGTGTTGGACTCGGGAGTC 
AGAACGGGCGACCTGCTCGGCTCGAGCTCCACTTCGGAGGTTGGCGACGCCATTGCGCTTGCAGTTAAGGAA 
GCCTTGCGCAGGCAATCCGCAGCTGGTCTGAGCTAGCCTCGAGGACCCTTCTCTTTAGACTATTCTACTCTT 
ATGCACGTAAAAAATTCTAGGAAATATGTATTAACTAGGAGTAAAATAACCGGCTAGTGGCATTCATATAGC 
CGTCTGTTTACATCTACATCACACATTTCGAGTGTATATCTCGCAACGTTGGCGTTAAATAGGCAGTCAATG 

'gcccgaccattctatggtgtttaggtcgatgccatctttgtacgtttagcttatcgatgataagctgtcaaa 
catgagaattcttgaagacgaaagggcctcgtgatacgcctatttttataggttaatgtcatgataataatg 

GTTTCTTAGACGTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATA 
CATTC AAAT AT GTATCCGCT CAT GAGA CAATAACCCTGAT AAAT GCTTCAATAATATTGAAAAAGGAAGAGT 
ATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCAC 
CCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGAT 
CT C AAC AGC GGT AAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCC AAT GAT GAGCACTTTT AAAGT T 
CTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCT 
CAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTA 
TGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAG 
GAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAAT 
GAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTA 
ACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGA 
CCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGGTCT 
CGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGT 
CAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGCATTGGTAACTG 
T C AGAC C AAGT T T AC T C ATATAT AC T T TAGAT T GAT T T AAAAC T T CAT T T T T AAT T T AAAAGGAT C T AG GT G 
AAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCC 
GTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAA 
CCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTC 
AGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTA 
GCACCGCCTACATACCTCGCTC 
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TGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGAT 
AGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGA 
CCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGG 
ACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGT 
ATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGC 
GGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACA 
TGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACCGCTC 
GCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGGAAGCGGAAGAGCGCCTGATGCGGTATTTTC 
TCCTTACGCATCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTGCTCTGATGCCGCAT 
AGTTAAGCCAGTATACACXCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACACC 
CGCTGACGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAG 
CTGCATGTGTCAGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGAGCTTTGAAGAAAAATGCGCCTT 
ATTCAATCTTTGCTATAAAAAATGGCCCAAAATCTCACATTGGAAGACATTTGATGACCTCATTTCTTTCAA 
TGAAGGGCCTAACGGAGTTGACTAATGTTGTGGGAAATTGGAGCGATAAGCGTGCTTCTGCCGTGGCCAGGA 
CAACGTATACTCATCAGATAACAGCAATACCTGATCACTACTTCGCACTAGTTTCTCGGTACTATGCATATG 
ATCCAATATCAAAGGAAATGATAGCATTGAAGGATGAGACTAATCCAATTGAGGAGTGGCAGCATATAGAAC 
AGCTAAAGGGTAGTGCTGAAGGAAGCATACGATACCCCGCATGGAATGGGATAATATCACAGGAGGTACTAG 
ACTACCTTTCATCCTACATAAATAGACGCATATAAGTACGCATTTAAGCATAAACACGCACTATGCCGTTCT 
TCTCATGTATATATATATACAGGCAACACGCAGATATAGGTGCGACGTGAACAGTGAGCTGTATGTGCGCAG 
CTCGCGTTGCATTTTCGGAAGCGCTCGTTTTCGGAAACGCTTTGAAGTTCCTATTCCGAAGTTCCTATTCTC 
TAGAAAGTATAGGAACTTCAGAGCGCTTTTGAAAACCAAAAGCGCTCTGAAGACGCACTTTCAAAAAACCAA 
AAACGCACCGGACTGTAACGAGCTACTAAAATATTGCGAATACCGCTTCCACAAACATTGCTCAAAAGTATC 
TCTTTGCTATATATCTCTGTGCTATATCCCTATATAACCTACCCATCCACCTTTCGCTCCTTGAACTTGCAT 
CTAAACTCGACCTCTACATTTTTTATGTTTATCTCTAGTATTACTCTTTAGACAAAAAAATTGTAGTAAGAA 
CTATTCATAGAGTGAATCGAAAACAATACGAAAATGTAAACATTTCCTATACGTAGTATATAGAGACAAAAT 
AG AAG AAAC C G T T C AT AAT T T T C T GA C C AAT G AAG AAT CAT C AAC G C TAT CACTTTCTGTT C A C AAAG T AT G 
CGCAATCCACATCGGTATAGAATATAATCGGGGATGCCTTTATCTTGAAAAAATGCACCCGCAGCTTCGCTA 
GT7^ATCAGTAAACGCGGGAAGTGGAGTCAGGCTTTTTTTATGGAAGAGAAAATAGACACCAAAGTAGCCTTC 
TTCTAACCTTAACGGACCTACAGTGCAAAAAGTTATCAAGAGACTGCATTATAGAGCGCACAAAGGAGAAAA 
AAAGTAATCTAAGATGCTTTGTTAGAAAAATAGCGCTCTCGGGATGCATTTTTGTAGAACAAAAAAGAAGTA 
TAGATTCTTTGTTGGTAAAATAGCGCTCTCGCGTTGCATTTCTGTTCTGTAAAAATGCAGCTCAGATTCTTT 
GTTTGAAAAATTAGCGCTCTCGCGTTGCATTTTTGTTTTACAAAAATGAAGCACAGATTCTTCGTTGGTAAA 
ATAGCGCTTTCGCGTTGCATTTCTGTTCTGTAAAAATGCAGCTCAGATTCTTTGTTTGAAAAATTAGCGCTC 
TCGCGTTGCATTTTTGTTCTACAAAATGAAGCACAGATGCTTCGTTCTGCGGTAAAGCTCATCAGCGTGGTC 
GTGAAGCGATTCACAGATGTCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGT 
CTGGCTTCTGATAAAGCGGGCCATGTTAAGGGCGGTTTTTTCCTGTTTGGTCACTGATGCCTCCGTGTAAGG 
GGGATTTCTGTTCATGGGGGTAATGATACCGATGAAACGAGAGAGGATGCTCACGATACGGGTTACTGATGA 
TGAACATGCCCGGTTACTGGAACGTTGTGAGGGTAAACAACTGGCGGTATGGATGCGGCGGGACCAGAGAAA 
AATCACTCAGGGTCAATGCCAGCGCTTCGTTAATACAGATGTAGGTGTTCCACAGGGTAGCCAGCAGCATCC 
TGCGATGCAGATCCGGTVACATAATGGTGCAGGGCGCTGACTTCCGCGTTTCCAGACTTTACGAAACACGGT^A 
ACCGAAGACCATTCATGTTGTTGCTCAGGTCGCAGACGTTTTGCAGCAGCAGTCGCTTCACGTTCGCTCGCG 
TATCGGTGATTCATTCTGCTAACCAGTAAGGCAACCCCGCCAGCCTAGCCGGGTCCTCAACGACAGGAGCAC 
GATCATGCGCACCCGTGGCCAGGACCCAACGCTGCGGGGGGGGGGGGGGTTTTCTTTCCAATTTTTTTTTTT 
TCGTCATTATAGAAATCATTACGACCGAGATTCCCGGGTAATAACTGATATAATTAAATTGAAGCTCTAATT 
TGTGAGTTTAGTATACATGCATTTACTTATAATACAGTTTTTTAGTTTTGCTGGCCGCATCTTCTCAAATAT 
GCTTCCCAGCCTGCTTTTCTGTAACGTTCACCCTCTACCTTAGCATCCCTTCCCTTTGC7VAATAGTCCTCTT 
CCAACAATAATAATGTCAGATCCTGTAGAGACCACATCATCCACGGTTCTATACTGTTGACCCAATGCGTCT 
CCCTTGTCATCTAAACCCACACCGGGTGTCATAATCAACCAATCGTAACCTTCATCTCTTCCACCCATGTCT 
CTTTGAGCAATAAAGCCGATAACAAAATCTTTGTCGCTCTTCGCAATGTCAACAGTACCCTTAGTATATTCT 
CCAGTAGCTAGGGAGCCCTTGCATGACAATTCTGCTAACATCAAAAGGCCTCTAGGTTCCTTTGTTACTTCT 
TCCGCCGCCTGCTTCAAACCGCTAACAATACCTGGGCCCACCACACCGTGTGCATTCGTAATGTCTGCCCAT 
TCTGCTATTCTGTATACACCCGCAGAGTACTGCAATTTGACTGTATTACCAATGTCAGCAAATTTTCTGTCT 
TCGAAGAGTAAAAAATTGTACTTGGCGGATAATGCCTTTAGCGGCTTAACTGTGCCCTCCATGGAAAAATCA 
GT CAAGATATC CACATGT 
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GTTTTTAGTAAACAAATTTTGGGACCTAATGCTTCAACT.AACTCCAGTAATTCCTTGGTGGTACGAACATCC 
AATGAAGCACACAAGTTTGTTTGCTTTTCGTGCATGATATTAAATAGCTTGGCAGCAACAGGACTAGGATGA 
GTAGCAGCACGTTCCTTATATGTAGCTTTCGACATGATTTATCTTCGTTTCCTGCAGGTTTTTGTTCTGTGC 
AGTTGGGTTAAGAATACTGGGCAATTTCATGTTTCTTCAACACCACATATGCGTATATATACCAATCTAAGT 
CTGTGCTCCTTCCTTCGTTCTTCCTTCTGCTCGGAGATTACCGAATCAAAAAAATTTCAAAGAAACCGGAAT 
CAAAAAAAAGAACAAAAAAAAAAAAGATGAATTGAAACCCCCCCCCCCCCCGATGCGCCGCGTGCGGCTGCT 
GGAGATGGCGGACGCGATGGATATGTTCTGCCAAGGGTTGGTTTGCGCATTCACAGTTCTCCGCAAGAATTG 
ATTGGCTCCAATTCTTGGAGTGGTGAATCCGTTAGCGAGGTGCCGCCGGCTTCCATTCAGGTCGAGGTGGCC 
CGGCTCCATGCACCGCGACGCAACGCGGGGAGGCAGACAAGGTATAGGGCGGCGCCTACAATCCATGCCAAC 
CCGTTCCATGTGCTCGCCGAGGCGGCATAAATCGCCGTGACGATCAGCGGTCCAGTGATCGAAGTTAGGCTG 
GTAAGAGCCGCGAGCGATCCTTGAAGCTGTCCCTGATGGTCGTCATCTACCTGCCTGGACAGCATGGCCTGC 
AACGCGGGCATCCCGATGCCGCCGGAAGCGAGAAGAATCATAATGGGGAAGGCCATCCAGCCTCGCGTCGCG 
AACGCCAGCAAGACGTAGCCCAGCGCGTCGGCCGCCATGCCGGCGATAATGGCCTGCTTCTCGCCGAAACGT 
TTGGTGGCGGGACCAGTGACGAAGGCTTGAGCGAGGGCGTGCAAGATTCCGAATACCGCAAGCGACAGGCCG 
ATCATCGTCGCGCTCCAGCGAAAGCGGTCCTCGCCGAAAATGACCCAGAGCGCTGCCGGCACCTGTCCTACG 
AGTTGCATGATAAAGAAGACAGTCATAAGTGCGGCGACGATAGTCATGCCCCGCGCCCACCGGAAGGAGCTG 
ACTGGGTTGAAGGCTCTCAAGGGCATCGGTCGACGCTCTCCCTTATGCGACTCCTGCATTAGGAAGCAGCCC 
AGTAGTAGGTTGAGGCCGTTGAGCACCGCCGCCGCAAGGAATGGTGCATGCAAGGAGATGGCGCCCAACAGT 
CCCCCGGCCACGGGGCCTGCCACCATACCCACGCCGAAACAAGCGCTCATGAGCCCGAAGTGGCGAGCCCGA 
TCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCCGGTGATGCCGGCCACG 
ATGCGTCCGGCGTAGA 
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Fig:30 



Methyl methanesulfonate 



c 
o 

"-5 

=3 

-a 
c 

CD 
O 

c 

CD 
U 

w 

CD 



5.00 
4.50 
4.00 
3.50 
3.00 
2.50 
2.00 
1.50 
1.00 
0.50 
0.00 



GENOTOXICITY EVALUATION 




✓ 



Test Compound Concentration (ug/ml) 



- RNR3-GFP - 



Threshold 



RNR3-GFP Reporter Response to MMS 



WO 2005/012533 



PCT/GB2004/003337 



44/61 



Fig:31 



1 ? 2-Dimethylhydrazine (dihydrochloride) 
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Ethyl methanesulfonate 
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Fig:33 

RNR3 sequence data downloaded from SGD, Chromosome IX: 



1 AGATTCTGCG CCAGCAAGTC GTCTCCGAGG GGGGGCCCAC CGCTACCATA 

51 CAGTCAAGGT TTAACTACGC ATGGGGGCTG ATCAAATCCA CTGACGTGAA 

101 TGACGAAAGG CTTGGTGTGA AAATCCTCAC AGAC AT T T AC AAAGAGGCCG 

151 AGTCCCGTAG ACGAGAATGC CTATATTATC TGACCATAGG TTGCTACAAA 

2 01 CTCGGTGAAT ACTCTATGGC GAAGAGATAT G TAG AC AC T T TATTTGAGCA 

2 51 TGAGCGTAAT AACAAGCAGG TGGGCGCTTT GAAGAGT AT G GTAGAGGATA 

3 01 AGATCCAGAA GGAAACACTC AAGGGTGTTG TCGTCGCTGG AGGCGTACTA 
351 GCCGGCGCTG TGGCCGTGGC TAGTTTCTTC T T AAGAAAC A AGAGAAGGTA 

4 01 ACAAGCACAT AAAAAATCAG CAC AT AC GT A CATACATAAG AATGAATCGC 
4 51 ACGCACGCGT AAACAT T TAT CATTTAATCT TCAGTTGTTA GATAAAAAAA 
501 AAAAGAAAAG AAAAGAAAGT GAAGGCTTGT TTCAGTTTGA AC T AGGTAGC 
551 AGAGCAAGCC CTCGTTCTTG GCTGCTAATT TTCCTAAAGT AGTAAAAAAA 

6 01 GCCAAGTTAT CTGCCTACGG TTGTCACAGC AACATTGCGT GCCGTTGTTC 
651 TTTTGTTTTT TTTTTTTTTT TTTTTTCGTG GTTGTCGCAG CAACGACACC 

7 01 TAGGCGCTGC TCAAAGGGGC AAAAACCCGG TTGCCATGGC GAGGAC C AAA 

7 51 CGACAAGATG GGAAAAAAAC AATAGTCTAT TGTTAAATCG TAATACTGTA 

8 01 TTGTGAGATG CTGACGCGTT TCGTTTTTCG TGTCAGCGTT CTTTATATTG 
8 51 TTTCGTGTTC TGCTGCAAAA CGTATATAAA CGCACTGCTA TTTTGCCTTC 
901 TTTTGCCTTC TTCCTTGCCT TTCTCTCATC TCATATCCAA GTTGAAATAA 
951 ATATGACAAG CAAGAAT AG C AGCAGCAATA AATCAAATAC TCCCACAgAA 

1001 ATCTACGTTA TTAAAAGAGA CGGCCGCAAA GAGCCCGTTC AATTCGATAA 

10 51 AATTACCTCC CGTATCACCC GTTTGTCATA CGGTTTAGAC CCAAACCGTA 

1101 TTGATGCTGT TAAGGTAACC CAACGTATTA TTTCTGGTGT GTACTCCGGT 

1151 GTTACTACCG TTGAGCTGGA CAATCTTGCA GCTGAAACAT GT GC AT AC AT 

12 01 GACCACTGTG CACCCTGATT ATGCCACTCT AGCCGCTAGA ATCGCCATCT 

12 51 CTAACTTACA TAAGCAAACC ACAAAGCAAT TCTCCAAAGT TATTGAGGAT 

1301 T T AC AC G AC T GGATTAACCC AGCTACTGGA AAGCATGCTC CTATGATTTC 

1351 GGACGAAATT T AC AAC AT T G T CAT G G AAAA CAAAGATACT TTGAACTCGG 

14 01 CCATCGTGTA CGATAGGGAT TTCCAGTATA CGTATTTCGG ATTCAAGACA 

14 51 CTGGAGCGTT CGTACTTGCT AAGACTGAAC GGTGAAGTGG CAGAACGTCC 

1501 TCAGCATTTG GTAATGCGTG TGGCGCTAGG TATCCATGGT AGCGATATCG 

1551 AATCTGTGCT GAAGACTTAT AATTTGATGT CGTTAAGATA CTTCACTCAC 

1601 GCTTCCCCAA CTTTATTCAA CGCTGGTACG CCACATCCTC AAATGTCTTC 

1651 ATGTTTCTTA ATTGCCATGA AGGATGACTC TATCGAAGGT ATT TAT GAT A 

17 01 CTTTGAAAGA ATGTGCTATG ATTTCCAAAA CTGCAGGTGG TGTTGGTCTT 

17 51 CATATCAACA ACATCCGTTC CACAGGTTCT TATATCGCTG GTACCAACGG 

18 01 TACTTCAAAC GGGTTGATTC CTATGATTCG TGTTTTCAAT AATACTGCCC 
1851 GTTATGTGGA CCAGGGTGGT AACAAGAGAC CTGGTGCTTT CGCCCTTTTC 
1901 TTGGAGCCAT GG CAT GCAGA TATCTTCGAC TTTGTCGATA TCAGAAAAAC 
1951 ACATGGTAAG GAAGAAAT T C GTGCAAGAGA TTTGTTCCCT GCTCTATGGA 
2001 TCCCTGATCT TTTCATGAAA CGTGTTCAAG AGGATGGGCC TTGGACTTTG 
2051 TTTTCGCCCA GTGCTGCCCC AGGTTTAGAT GATGTGTGGG GTGATGAATT 
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2101 TGAAGAACTA TATACTCGTT AC GAAAG AG A AGGTCGTGGT AAAACAATTA 

2151 AAGCCCAAAA GTTGTGGTAT GCCATTTTGC AAGCACAGAC AGAAACAGGT 

22 01 ACACCTTTCA TGGTTTATAA GGACGCATGT AACAGGAAGA CAAACCAACA 
2251 GAACTTAGGT ACTATCAAAT CATCTAATTT ATGTTGTGAA ATCGTCGAAT 
2301 ATTCCTCCCC GGATGAAACT GCAGTTTGTA ATTTAGCTTC TATTGCCCTA 

23 51 CCAGCATTCG TTGAGGTTTC AGAAGAT GGT AAAACTGCAA GCTATAATTT 

24 01 C GAG AGAT T A CACGAGATTG CTAAAGTCAT TACTCACAAC TTGAACAGAG 

24 51 TTATCGACCG TAATTACTAT CCAGTTCCCG AGGCTAGAAA TTCAAATATG 

25 01 AAGCATAGAC CTATTGCTCT TGGTGTCCAG GGTTTGGCCG AT ACT TAT AT 
2551 GATGTTGCGT CTACCCTTTG AATCGGAAGA AGCT CAAACT CTAAACAAAC 
2 601 AAATCTTCGA AACTATTTAC CATGCTACTC TTGAAGCCTC CTGTGAATTG 
2 651 GCCCAAAAAG AAGGTAAATA TTCTACTTTT GAAGGTTCTC CAGCTTCTAA 
27 01 GGGTATTTTA CAATTCGATA TGTGGAACGC TAAACCATTT GGCATGTGGG 
27 51 ATTGGGAAAC CTTAAGAAAG GACATTGTTA AACATGGGTT AAGAAACTCT 
2 8 01 TTGACTATGG CACCAATGCC AACCGCCTCA ACTTCCCAAA TTCTTGGTTA 
2 8 51 TAATGAATGC TTCGAACCAG TGACCTCAAA CATGTACTCT CGTCGTGTCC 
2901 TGTCTGGTGA ATTCCAAGTT GTTAATCCAT ATTTACTACG TGATTTAGTC 
2 951 GACCTGGGTA TTTGGGATGA TAGTATGAAA CAATATCTAA TTACACAAAA 
30 01 TGGTTCTATT CAAGGCTTAC CAAATGTGCC ACAAGAATTG AAGGAATTAT 
3051 ACAAAACCGT CTGGGAAATC TCTCAAAAGA C CAT TAT C AA TATGGCTGCT 
3101 GATCGTGCCA TCTACATCGA TCAGTCTCAT TCCTTGAATC TTTTCTTGCA 
3151 AGCACCATCA ATGGGTAAGA TTACTAGTAT GCATTTCTAC GGTTGGAAGA 
32 01 AGGGTTTAAA AACTGGTATG TACTACTTAA GAACGCAAGC CGCCTCCGCT 
3251 GCTATTCAAT TTACCATTGA TCAAGAGGTT GCCGATCAAG CCGCTACACA 
3301 TATTGCTTCC GTCTCAGAAT TGGATCGTCC AGTTTATGTT CCAAAGGGTA 
3351 CAAAATTCTC TGAACAAAAG GCGGCATCTG CGCTTACCGA AAGCTCAGAT 
34 01 AAT GAGAAGG ATGCATCTCC AGTTCCATCC GAAC AAT CAT CGGTGTCGAG 
34 51 TGCCATGTCA AATGTGAAAT TGGAAGATAG TGTTGCCCCA GCAGTTCCAA 
3501 CGGAAACAAT AAAAGAAGAT TCCGACGAGA AGAAATGTGA CATTTACAAT 
3551 GAAAAGGTGA TTGCTTGTAC TGCTCCTACT CCAGAAGCTT GTGAGTCATG 

3601 TTCCGGTTGA 
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Fig:35 

Fragment of HO sequence used in the integrating vector (pWDH443) 

1 AAATTGTGAC AGCTTTCCAG AATGGATTAT TTTTCCTCAA ATTCCTTGTC 

51 TTCCTGTTTT CATCTGGACC ATCTCCATAA TGAAGCCTTA CATGTTTGGC 

101 ACGTAGCGGA ACGTGATCGT CACAAACCGT AAGGTAGAGA CCCCAGATTT 

151 TCGCATTTTC TCTTAAACTC TCCATTAGCT TAGGATCCAA GC TAT C TACT 

2 01 GAGATTTCTG GCTCTTTTGT TGTACTGTCA CCTAACCACA GACCAAGCAT 

2 51 CCAAGCCATA CTTTTTACAG CAGGAGTTAC AAGGTCACTA CGTCCAGTGA 

301 GAAATTTAGA TAAAAC AC CA TTTCCTGCGA GTACTGGACC AAATCTTATG 

351 CAGC TAGAAA TTCTCAATTG AGCATCAAGA TAATCCAAAT CTCTAACTTC 

4 01 AATGTCAAAG TTGAAATATT CTCCTTTAGA GCGCTCCATT TCTTCTATGA 

4 51 AGCGTTTTGC GGCAAACTCA CCTTCAACTG TCATTGGGAA TGTCTTATGA 

501 TGGTTTTTTG GAATTATTAT TATCCTACCA TCAAGCGTCT GACATTGCTG 

551 CAGATTTCTC CATCTCACTT TATATTTGGT GGCATTTCTA CCACTTTTTT 

6 01 CCAACAGTGG TTTGGTAGGG ACCCTGACTG ACAATTTATG ACCTGCAGTA 
651 CATTGTAATG CAAGACGCTG ATAAACTGTT CTACGCCTGG GATCTAACCT 

7 01 ACCAGGTTCA CCTTCAAAAG CTCTGTGTTT GGTTTTTTGC TGTATATTAT 
751 AGATTTTCTG ATAGCCCTGT GTGACATTTA TGACGCGGGC AGCGGAGCCA 

8 01 TCTGCGCACA TAACGTAAGA GTTAGCCGTG ACGTTTGCGA TGTCTTTAAT 
8 51 TTCACCGTTA GCCATCAGAA TAGTCGTGTT TTCAGAAAGC AT 
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Fig:37 



rDNA sequence used in multiple copy rDNA integrating plasmids 
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GATCTGACGA 

CAAAAACATA 

TCGAACTTGT 

CCTCACACTT 

ATCTTACTTT 

TATTTCTCGC 

GCGTATATTT 

CACTATTTTT 

AACATACAAA 

ACTTGCAAAA 

AC GGATATCA 

CCTTCAGTTT 

AG AC C CT AAA 

GGACTATACC 

TTTTCTTACT 

GCAAAAATGT 

TTCCGCACCT 

TGGCCTATCG 

TAGACTTACG 

GTCCCACATA 

TAAATGCAAA 

GCCTGCCACC 

CTTGAAACTA 

TCGACCTACC 

TATTCTCCAC 

TCCTATTGCT 

ACCTCTCACT 

CTGCGTTTCC 

GTTTCCCGTC 

GTGTAGTGGG 

TTTTTTTTTT 

TAAATCCCAT 

AGCTGCACTC 

TGCTACTACT 

AAAAAAAAAA 

TCTTCTACGG 

GTTGCTTCTT 

CACTATATGA 

TCGCCGCGTC 

TATACATGTA 

TTCTTCCCAG 

CATCATTCCC 

ATGTCCCCAC 

TGGTAAAATC 

GTTAACTATA 

CCTCATGTTT 

ACTGTCCGGG 

TCTTTCAACC 



TCACCTAGCG 

AC GAAC GACA 

CTTCAACTGC 

GTACTCCATG 

TATTTTCTTT 

CGAGAAAAAC 

TTTTTCCAAA 

TACTGCGGAA 

ACATACAAAA 

TATCATACCA 

TACATTACAC 

CCCTTTTTCT 

GGGAAATCCA 

ACCCCACCAC 

CCTATGTCTA 

AAAAT CAT C A 

TTTCCTCTGT 

GAATACATTT 

TTTGCTACTC 

CTAAATCTCT 

ACAAATACCA 

ATCCATTTGT 

CCTCTGCATG 

TCTTGTAAAT 

ATCACAATGC 

ATTAGATGAA 

CCCACCTACT 

GTTAAACTAT 

CGATCAACTG 

TGACCATACG 

T T T T T T T 

AACTAACCTA 

TTCTTCTGAA 

TGATATGTAC 

AAAAAC AC T C 

TTAATACTTT 

ATTCCTTCCC 

TCGTAGTACA 

GCCAAAAATT 

TATATTGCAC 

TAGCCTCATC 

TAGAAACTGC 

TGTTCACTGT 

GTAGTTCGTA 

GGAAATGAGC 

GCCGCTCTGA 

CAAATCCTTT 

CATCTTTGCA 



ACTCTCTCCA 

AGCCTACTCG 

TTTCGCATGA 

ACTAAACCCC 

TGCCCTCTCT 

TTCAATTTAA 

GTGACAGGTG 

GCGGAAGCGG 

TATACCTTTC 

AAAAACTTTT 

TACCACCATT 

GCCTTTTTCG 

TGCCATAACA 

ACTCCTACCA 

TTCATCTTTC 

CACAAAACAT 

CCACTTTCAA 

TCTACATCCT 

TCATGGTCTC 

TCCCGTCATT 

TCTATGTCTT 

CTTTTGCACC 

CCACCTACCG 

GACAAATCAC 

ACTATTGCTT 

ATATAATAAA 

GAACATGTCT 

CGGTTGCGGC 

TAGTTAAGCT 

CGAAACTCAG 

TTTTTTTTCT 

CCATTCGATT 

GAGT TAAGC A 

AAACAATATT 

CGGTTTTGTT 

CCTCTTCGTC 

GCTTTCCTGC 

TCTTACAACT 

TACTTCGCCA 

TGGCTATTCA 

CTTTTACGCT 

CATTTACTTA 

TCACTGTTCA 

GTATTTTTTT 

TTTTCTCAAT 

TGGTGCGGAA 

CACGCTCGGG 

ACGAAAAAAA 



CCGTTTGACG 

AATTCGTTTC 

AGTACCTCCC 

CCCTCCCATT 

GTCGCTCTGC 

GCTATTCTCC 

CCCCGGGTAA 

AAAATACGGA 

TCACACAAGA 

CACAACCGAA 

CAAACTTTAC 

GTGACGGAAA 

GGAAAGTAAC 

ATAACGGTAA 

ATCTGACTAC 

AAACAATCAA 

CCGTCCCTCC 

AACTACTATA 

AATACTGCCG 

ATCGCCCGCA 

CCACACCATC 

ATATCTTCAT 

ACCAACTTTC 

CTTTTTCATC 

TTGCTTTTTC 

AATTGTCCTC 

GGACCCTGCC 

CATATCTACC 

GGTAAGAGCC 

GTGCTGCAAT 

AGTTTCTTGG 

CAGAAAAATT 

CTCCATTATG 

CTCCTCCGAT 

CTCTTCCCTC 

TTTTTCTACA 

ACTAACATTT 

CCGCATACCG 

AC CAT TC CAT 

TCTTGCACTT 

GCCTCTCTGG 

AAAAAAAAAA 

CTTGTCTCTT 

T CAT AT C AAA 

TCTCTAAACT 

AAAACTGCTC 

AAGCTTTGTG 

AAAAAAAAAT 



AGGCCATTTA 

CAAACTCTTT 

AACTACTTTT 

ACAAACTAAA 

CTTAACTACG 

AAAAATCTTA 

CCCAGTTCCT 

AACGCGCGGG 

AATATATGCT 

ACCAAAACCA 

TACTATCCTC 

TACGCTTCAG 

ATCCCAATGC 

CTATTCTATG 

CTAATACTAT 

AAT C AG C CAT 

AAATGTAAAA 

AAACAACCTT 

CCGACATTCT 

TCCGGTGCCG 

ATTTTACTAT 

AACCTGTCAC 

ATGTTCTGTT 

GTATGCACCT 

ACCTGTCATA 

CACCCATAAC 

CTCATATCAC 

AG AAA G C AC C 

TGACCGAGTA 

CTTTATTTCT 

CTTCCTATGC 

CGCACTATCC 

CTCATTGGGT 

ATTCCTACAA 

CATTTCCCTC 

CCCTCGTTTA 

TGCCGCATTA 

CGTCGCCGCG 

ATCTGTTAAG 

TTCCTCTTTC 

AACTTGCCAT 

AAAAAAAAAA 

ACATCTTTCT 

GGCATGTCCT 

TATACAAGCA 

CAT GAAGC AA 

AAAGCCCTTC 

AAAAAATAAA 
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24 01 AAGACCAAAT AGTAAATAGT 

24 51 TTACACACTA TCATCCTCAT 

2501 GTTTTTACCC GGATCATAGA 

2551 TTGTTCAGTC TACTTCTCTC 

2 601 ACT T AGAAAG AAATAAAAAA 

2 651 AGATCGTAAC AACAAGGCTA 

27 01 CTAAGTCGTA TACAAATGAT 
2751 GCCAGCAAGC ACCCAAGGCC 

28 01 TATGGTTCAG CGACGCCACA 
2851 TGTGGAGCAA AGAAAT C AC C 
2901 GTTCAGCCAT AATCCAGCGG 

2 951 CAGCCGCAAA AACCAATTAT 
3001 CAATTACTAT TGCGGTAACA 
3051 ACGACGGTCT AAACCCAGCT 
3101 ACGCTTACCG AATTCTGCTT 
3151 AT CAAAAAGC AATGTCGCTA 
32 01 CCTGTGGTAA CTTTTCTGGC 

32 51 G GAT C GAT AG GCCACACTTT 

33 01 TCAAGGGGGC TTTTACCCTT 

33 51 AGCCCCCCTT AG G AC AT C T G 

34 01 CCAAACTCCC CACCTGACAA 
34 51 ACCTTGAATG CTAGAACGTG 
3501 TAAGTAAAGA AACTATAAAG 
3551 CACTTATTCT ACACCCTCTA 

3 601 GCTCAACAGG GTCTTCTTTC 

3 651 GCTGTGGTTT CGCTAGATAG 
37 01 CATTCATGCG CGTCAC TAAT 

37 51 GAGT CAT AGT TACTCCCGCC 

38 01 TTTGACATTC AGAGCACTGG 
38 51 CTGACCATCG CAATGCTATG 
3901 CGTACCAGTT CTAAGTTGAT 
3951 AGACCTACCA AGGCCGTCTA 

4 001 GAGCAAGCCC C AC CAAGCAG 
4 051 GGCCCTCACT ACCCGACCCT 
4101 TCTATTTTGC CGACTTCCCT 
4151 TTCACCTTGG AGACCTGCTG 
42 01 ATTCCTTCCT GTGGATTTTC 
42 51 AAAGGTGCTG GCCTCTTCCA 
4 3 01 TTCCGGGGTG ATAAGCTGTT 
4 351 GCGCCGACGT CTCCACATTC 
4 4 01 GGTTCCGGAA TCTTAACCGG 
4 4 51 CAGGCCTTTG AAACGGAGCT 
4501 TCCAACTGCT GTTGACGTGG 
4 551 TCATTTGAAT ATTTGCTACT 



51/61 

AACTTACATA CATTAGTAAA TGGTACACTC 

CGTATATTAT AATAGATATA TACAATACAT 

ATTCTTAAGA CAAATAAAAT T TATA GAGA C 

TAAACTAGGC CCCGGCTCCT GCCAGTACCC 

C AAAT C AG AC AACAAAGGCT TAATCTCAGC 

CTCTACTGCT TACAATACCC CGTTGTACAT 

TTATCCCCAC GCAAAATGAC ATTGCAATTC 

TTTCCGCCAA GTGCACCGTT GCTAGCCTGC 

AGGACGCCTT ATTCGTATCC ATCTATATTG 

GCGTTCTAGC ATGGATTCTG ACTTAGAGGC 

ATGGTAGCTT CGCGGCAATG CCTGATCAGA 

CCGAATGAAC TGTTCCTCTC GTACTAAGTT 

TTCATCAGTA GGGTAAAACT AACCTGTCTC 

CACGTTCCCT ATTAGTGGGT GAACAATCCA 

CGGTATGATA GGAAGAGCCG ACATCGAAGA 

TGAACGCTTG ACTGCCACAA GCCAGTTATC 

ACCTCTAGCC TCAAATTCCG AGGGACTAAA 

CATGGTTTGT ATTCACACTG AAAATCAAAA 

TTGTTCTACT GGAGATTTCT GTTCTCCATG 

CGTTATCGTT TAACAGATGT GCCGCCCCAG 

TGTCTTCAAC CCGGATCAGC CCCGAATGGG 

GAAAAT GAAT TCCAGCTCCG CTTCATTGAA 

GTAGTGGTAT TTCACTGGCG CCGAAGCTCC 

TGTCTCTTCA CAATGTCAAA CTAGAGTCAA 

CCCGCTGATT CTGCCAAGCC CGTTCCCTTG 

TAGATAGGGA CAGTGGGAAT CTCGTTAATC 

TAG AT GAC G A GGCATTTGGC TACCTTAAGA 

GTTTACCCGC GCTTGGTTGA ATTTCTTCAC 

G C AGAAAT C A CATTGCGTCA ACATCACTTT 

TTTTAATTAG ACAGTCAGAT TCCCCTTGTC 

CGTTAATTGT AGCAAGCGAC GGTCTACAAG 

CAACAAGGCA CGCAAGTAGT CCGCCTAGCA 

TCCACAAGCA CGCCCGCTGC GTCTGACCAA 

TAGAGCCAAT CCTTATCCCG AAGTTACGGA 

TATCTACATT AT T C TAT C AA CTAGAGGCTG 

CGGTTATCAG TACGACCTGG CATGAAAACT 

ACGGGCCGTC ACAAGCGCAC CGGAGCCAGC 

GCCATAAGAC CCCATCTCCG GATAAACCAA 

AAGAAGAAAA GATAACTCCT CCCAGGGCTC 

AGTTACGTTA CCGTGAAGAA T C CAT AT C C A 

ATTCCCTTTC GATGGTGGCC TGCATAAAAT 

TCCCCATCTC TTAGGATCGA CTAACCCACG 

AACCTTTCCC CACTTCAGTC TTCAAAGTTC 
ACCACCAAGA TC 
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Fig:39 
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Fig:41 P GenIn012 - 7 515 bp 



1 GAACGCGGCC GCCAGCTGAA GCTTCGTACG CTGCAGGTCG AC G GAT C AAA 

51 ATTGTGACAG CTTTCCAGAA TGGATTATTT TTCCTCAAAT TCCTTGTCTT 

101 CCTGTTTTCA TCTGGACCAT CTCCATAATG AAGCCTTACA TGTTTGGCAC 

151 GTAGCGGAAC GTGATCGTCA CAAACCGTAA GGTAGAGACC CCAGATTTTC 

2 01 GCATTTTCTC TTAAACTCTC CATTAGCTTA GGATCCAAGC TATCTACTGA 

251 GATTTCTGGC TCTTTTGTTG TACTGTCACC TAACCACAGA CCAAGCATCC 

301 AAGCCATACT T T T T AC AG C A GGAGTTACAA GGTCACTACG TCCAGTGAGA 

351 AATTTAGATA AAACAC CAT T TCCTGCGAGT ACTGGACCAA ATCTTATGCA 

4 01 GCTAGAAATT CTCAATTGAG CATCAAGATA ATCCAAATCT CTAACTTCAA 

451 TGTCAAAGTT GAAATATTCT CCTTTAGAGC GCTCCATTTC TTCTATGAAG 

501 CGTTTTGCGG CAAACTCACC TTCAACTGTC ATTGGGAATG TCTTATGATG 

551 GTTTTTTGGA AT TAT TAT T A TCCTACCATC AAGCGTCTGA CATTGCTGCA 

601 GATTTCTCCA TCTCACTTTA TATTTGGTGG CATTTCTACC ACTTTTTTCC 

651 AACAGTGGTT TGGTAGGGAC CCTGACTGAC AATTTATGAC C T G C AG T AC A 

7 01 TTGTAATGCA AGACGCTGAT AAACTGTTCT ACGCCTGGGA TCTAACCTAC 
751 CAGGTTCACC TTCAAAAGCT CTGTGTTTGG TTTTTTGCTG TATATTATAG 

8 01 ATTTTCTGAT AGCCCTGTGT G AC AT T TAT G ACGCGGGCAG CGGAGCCATC 
8 51 TGCGCACATA AC GTAAGAGT TAGCCGTGAC GTTTGCGATG TCTTTAATTT 
901 CACCGTTAGC CATCAGAATA GTCGTGTTTT CAGAAAGCAT TTTGATCCGA 
951 CATACGATGA CCTCAATGAT T TAG AT TAT G TGTTGCACTT T TAT AG AC C T 

1001 AC C AAAAAT C CAGTGCGTAC ACTAATACTT T CAT AAA GAT ACCTGAAACA 

1051 AT AAC CAGAA AGATCGGCAA AAAAATTTTT TTTCTTTGCC GAGATCACAA 

1101 ACCTACTATG AC GAAAAAGC TTGAAGTTTA GATGAGTAAG GAAAATACAA 

1151 GTGACGCTTT TATATGGTGC AAGGAACAAA AACTAAAAAC AACAAGGCAA 

12 01 ATGTGGATCT GTCATGTATG GCAACGACAG CAGGATGGCT CACAAAAAAA 

1251 GACAAAAAAA ACTAAGGCAA AAGAACAAAG CTCCTCTCCT GCTCAAGAAA 

1301 CGTATTGTTG AAAAAC C AC C GTCGTAAGAA AGTTTTTCTG TGACCTATAA 

1351 TGGTTTAAAA TCGGCCCATT TTTTTTCCCT CTTTTGTGGT CCAGTCTTTC 

14 01 T CAT AC T C G A GGGAAATTCG ACACAAACAG CGGAGAAGTG TGGCTAAACC 

14 51 GGCAAGTGCC TGCAAGATCC ACAGAACTAA CCGCACGAAC TGGCGGTCAG 

1501 AAAAG AG C C T GTTCCGGAAA GAGAGAAACA GAGAAACGAT CAT GAT GGGA 

1551 AAGCGGGGAT TCGGCGAAGA ACGAGACTGG AAAGGGAAAA AGAGAAATAC 

1601 TGGTGGAAGT ATTCGGACCT TTGGCGAAGT CCGAACCCTT GAAACCCAAA 

1651 GATGATCGAT GATTCATTTT TCAATGCGCT ACGGTTCCTG CCGCTCGTGG 

17 01 GAACCCCACG CAAAACATAT TATTCGCTTC TCTCTGCTGA CAACTCCGGT 
1751 T T AC G T TATA CCGTATTAGG AT C AC T AT AA GGGTTCCTTC GG GAG GAGGG 

18 01 GGGAGGGGAA GAATGTACAT CGTCATAAGG CCTTTATGGT GTGAAGTGGG 
18 51 TTTTGCGTGG AAAATTCGTT TTCAATGATA TAGAGCCCAC GCATATACGT 
1901 ACATACTAGT GGCCAAAAGC GTGGGGTGGG CGGACAAAGC TACACTGGTA 
1951 AAATACAGGA TTCTATGAAC AATAACAACA ACCAGCTCAC GTTGCTGAAC 
2001 AGCCGAGGTC AGCCGATGCA ACCGAGGTTT CCAAAGTAGC ATTTCTGTGC 
2051 TAGCTATGTC TGTAGGTTTA CATTTAATGG TGCGTGGTTC CAGCTTCATG 
2101 TGCTTGCATG TGATGTCCTG CAGATGGTAA GAAGATTCTG AAAGCCGCGC 
2151 TAGGAGAAAA ATATTCTGCT CGAAGATCTG TCCTCTTAAG TAGAAAGCGT 
2201 GAAATTGTTG CGTTCTTGCA TTACTACTCA ACGCGTACGC AAATGCGTCT 
22 51 ACTGCACCTG CATGATAAAG CTTATGTATC AAAAATTTAA CATCTTGAAA 
2301 ATACACAAGT GGTGCAAAGA TGTGTCACGT TCTGGACCTG AGTGGTGCCA 
2351 TGTATGCTAT TTAACATGCA AAGGGGAAGA CCCTTCCGCC TTACTGCAAT 
24 01 AATAAAAAGT ATTTTACGCG TTACCCAATA TAGCAAAGTT TCGCGCAAAA 
2 4 51 AAAAAAATAA AAAACAATTA CAAACAAAAA GAAAAAAAAG GAAATAATAG 
2501 AAGATCTAAC TGAAGCGAAG GCCAAAACTC TTCTCACTTG ACGTAATAGC 
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2 551 CGATACAAAA TCTAGAGCAG CAACTTTTCT CTTTCTTCAC TAAAGCTGCT 

2 601 AC GAAAG TAT AGAAAAAT C A AACGCTCAGA ACTTAGCTCT ATTTCAAGGT 

2 651 ACCATATATA TTTCCTTATA ACTGATGTTA ATTAACTCTA AAGGTGAAGA 

2 701 ATTATTCACT GGTGTTGTCC CAATTTTGGT TGAATTAGAT GGTGATGTTA 

2751 ATGGTCACAA ATTTTCTGTC TCCGGTGAAG GTGAAGGTGA TGCTACTTAC 

2 8 01 GGTAAATTGA CCTTAAAATT TATTTGTACT ACTGGTAAAT TGCCAGTTCC 

2 851 ATGGCCAACC TTAGTCACTA CTTTCGGTTA TGGTGTTCAA TGTTTTGCGA 

2901 GATACCCAGA TCATATGAAA CAACATGACT TTTTCAAGTC TGCCATGCCA 

2 951 GAAGGTTATG TTCAAGAAAG AACTATTTTT TTCAAAGATG ACGGTAACTA 
3001 CAAGACCAGA GCTGAAGTCA AGTTTGAAGG T GAT AC C T T A GTTAATAGAA 

3 051 TCGAATTAAA AGGTATTGAT TTTAAAGAAG ATGGTAACAT TTTAGGTCAC 
3101 AAATTGGAAT ACAACTATAA CTCTCACAAT GTTTACATCA TGGCTGACAA 
3151 ACAAAAGAAT GGTATCAAAG TTAACTTCAA AAT T AGAC AC AACATTGAAG 
32 01 ATGGTTCTGT TCAATTAGCT GAC CAT TAT C AACAAAATAC TCCAATTGGT 
3251 GATGGTCCAG TCTTGTTACC AGACAACCAT TACTTATCCA CTCAATCTGC 
3301 CTTATCCAAA GATCCAAACG AAAAGAGAGA CCACATGGTC TTGTTAGAAT 
3351 TTGTTACTGC TGCTGGTATT ACCCATGGTA T G GAT G AAT T GTACAAATAA 
34 01 CTGCAGGGCG CGCCACTTCT AAATAAGCGA ATTTCTTATG ATTTATGATT 
34 51 TTTATTATTA AATAAGTTAT AAAAAAAATA AGTGTATACA AATTTTAAAG 
3501 TGACTCTTAG GTTTTAAAAC GAAAATTCTT ATTCTTGAGT AACTCTTTCC 
3551 TGTAGGTCAG GTTGCTTTCT CAGGTATAGT ATGAGGTCGC TCTTATTGAC 
3 601 CACACCTCTA CCGGCAGATC CGCTAGGGAT AACAGGGTAA TATAGATCTG 

3 651 CCCGCCGGGA AGGCGAACCC GATCGGATGC ATCCTCTCTG CTGCCATGAT 

37 01 GCTGAAGTTG TCGTTGAACA TGGTTGCTGC CGGCGAGGCG GTCGAGCAGG 
3751 CAGTGCAGGA GGTGTTGGAC TCGGGAGTCA GAACGGGCGA CCTGCTCGGC 

38 01 TCGAGCTCGA ATTCATCGAT GATATCAGAT CCACTAGTGG CCTATGCGGC 
3851 CGCGGATCTG CCGGTCTCCC TATAGTGAGT CGTATTAATT TCGATAAGCC 
3901 AGGTTAACCT GCATTAATGA ATCGGCCAAC GCGCGGGGAG AGGCGGTTTG 
3951 CGTATTGGGC GCTCTTCCGC TTCCTCGCTC ACTGACTCGC TGCGCTCGGT 

4 0 01 CGTTCGGCTG CGGCGAGCGG TATCAGCTCA CTCAAAGGCG GTAATACGGT 
4 051 TATCCACAGA ATCAGGGGAT AACGCAGGAA AGAACATGTG AGCAAAAGGC 
4101 CAGCAAAAGG CCAGGAACCG TAAAAAGGCC GCGTTGCTGG CGTTTTTCCA 
4151 TAGGCTCCGC CCCCCTGACG AGCATCACAA AAATCGACGC TCAAGTCAGA 
4201 GGTGGCGAAA CCCGACAGGA C TAT AAA GAT ACCAGGCGTT TCCCCCTGGA 
4251 AGCTCCCTCG TGCGCTCTCC TGTTCCGACC CTGCCGCTTA CCGGATACCT 
4 301 GTCCGCCTTT CTCCCTTCGG GAAGCGTGGC GCTTTCTCAA TGCTCACGCT 
4351 GTAGGTATCT CAGTTCGGTG TAGGTCGTTC GCTCCAAGCT GGGCTGTGTG 
4 4 01 CACGAACCCC CCGTTCAGCC CGACCGCTGC GCCTTATCCG GTAACTATCG 
4 451 TCTTGAGTCC AACCCGGTAA GAC AC GAC TT ATCGCCACTG GCAGCAGCCA 
4 501 CTGGTAACAG GATTAGCAGA GCGAGGTATG TAGGCGGTGC TACAGAGTTC 
4 551 TTGAAGTGGT GGCCTAACTA CGGCTACACT AGAAGGACAG TATTTGGTAT 
4 601 CTGCGCTCTG CTGAAGCCAG TTACCTTCGG AAAAAGAGTT GGTAGCTCTT 
4 651 GATCCGGCAA ACAAACCACC GCTGGTAGCG GTGGTTTTTT TGTTTGCAAG 
4 7 01 CAGCAGATTA CGCGCAGAAA AAAAGGATCT CAAGAAGATC CTTTGATCTT 
4751 TTCTACGGGG TCTGACGCTC AGTGGAACGA AAACTCACGT TAAGGGATTT 
4 8 01 TGGTCATGAG AT TAT C AAAA AGGATCTTCA CCTAGATCCT TTTAAATTAA 
4 851 AAATGAAGTT TTAAATCAAT CTAAAGTATA TATGAGTAAA CTTGGTCTGA 
4 901 CAGTTACCAA TGCTTAATCA GTGAGGCACC TATCTCAGCG ATCTGTCTAT 
4951 TTCGTTCATC CATAGTTGCC TGACTCCCCG TCGTGTAGAT AACTACGATA 
5001 CGGGAGGGCT TACCATCTGG CCCCAGTGCT GCAAT GATAC CGCGAGACCC 
5051 ACGCTCACCG GCTCCAGATT TATCAGCAAT AAACCAGCCA GCCGGAAGGG 
5101 CCGAGCGCAG AAGTGGTCCT GCAACTTTAT CCGCCTCCAT CCAGTCTATT 
5151 AATTGTTGCC GGGAAGCTAG AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG 
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52 01 CAACGTTGTT GCCATTGCTA CAGGCATCGT GGTGTCACGC TCGTCGTTTG 

5251 GTATGGCTTC ATTCAGCTCC GGTTCCCAAC GATCAAGGCG AG T TAG AT G A 

5301 TCCCCCATGT TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC CTCCGATCGT 

5351 TGTCAGAAGT AAGTTGGCCG CAGTGTTATC ACTCATGGTT ATGGCAGCAC 

54 01 TGCATAATTC TCTTACTGTC ATGCCATCCG TAAGATGCTT TTCTGTGACT 

54 51 GGTGAGTACT CAACCAAGTC ATTCTGAGAA TAGTGTATGC GGCGACCGAG 

5501 TTGCTCTTGC CCGGCGTCAA TACGGGATAA TACCGCGCCA CATAGCAGAA 

5551 CTTTAAAAGT GCTCATCATT GGAAAACGTT CTTCGGGGCG AAAACTCTCA 

5 601 AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA CTCGTGCACC 

5 651 CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA 

57 01 AAACAGGAAG GCAAAATGCC GCAAAAAAGG GAATAAGGGC GACACGGAAA 

57 51 TGTTGAATAC TCATACTCTT CCTTTTTCAA TATTATTGAA GCATTTATCA 

58 01 GGGTTATTGT CTCATGAGCG GAT AC AT AT T TGAATGTATT TAGAAAAATA 
5851 AACAAATAGG GGTTCCGCGC ACATTTCCCC GAAAAGT GCC ACCTGACGTC 
5901 GAAT AT CAT T GAGAAGCTGC ATTTTTTTTT TTTTTTTTTT TTTTTTTTTT 
5951 TATA TAT ATT TCAAGGATAT ACCATTGTAA TGTCTGCCCC TAAGAAGATC 
6001 GTCGTTTTGC CAGGTGACCA CGTTGGTCAA GAAAT CAC AG CCGAAGCCAT 
6051 TAAGGTTCTT AAAGCTATTT CTGATGTTCG TTCCAATGTC AAGTTCGATT 
6101 TCGAAAATCA TTTAATTGGT GGTGCTGCTA TCGATGCTAC AGGTGTTCCA 
6151 CTTCCAGATG AGGCGCTGGA AGCCTCCAAG AAGGCTGATG CCGTTTTGTT 
62 01 AGGTGCTGTG GGTGGTCCTA AATGGGGTAC CGGTAGTGTT AGACCTGAAC 
6251 AAGGTTTACT AAAAATCCGT AAAGAACTTC AATTGTACGC CAACTTAAGA 
6301 CCATGTAACT TTGCATCCGA CTCTCTTTTA GACTTATCTC CAATCAAGCC 
6351 ACAATTTGCT AAAGGTACTG ACTTCGTTGT TGTCAGAGAA TTAGTGGGAG 
64 01 GTATTTACTT TGGTAAGAGA AAGGAAGACG ATGGTGATGG TGTCGCTTGG 
64 51 GATAGTGAAC AATACACCGT TCCAGAAGTG C AAA GAAT CA CAAGAATGGC 
6501 CGCTTTCATG GCCCTACAAC ATGAGCCACC ATTGCCTATT TGGTCCTTGG 
6551 AT AAAGC TAA TGTTTTGGCC TCTTCAAGAT TAT G GAG AAA AACTGTGGAG 
6601 GAAAC CATC A AGAACGAATT CCCTACATTG AAGGTTCAAC ATCAATTGAT 
6651 TGATTCTGCC GCCATGATCC TAGTTAAGAA CCCAACCCAC CTAAATGGTA 
67 01 TTATAATCAC CAGCAACATG TTTGGTGATA TCATCTCCGA TGAAGCCTCC 

67 51 GTTATCCCAG GTTCCTTGGG TTTGTTGCCA TCTGCGTCCT TGGCCTCTTT 

68 01 GCCAGACAAG AACACCGCAT TTGGTTTGTA CGAACCATGC CACGGTTCTG 
6851 CTCCAGATTT GCCAAAGAAT AAGGTCAACC CTATCGCCAC TATCTTGTCT 
6901 GCTGCAATGA TGTTGAAATT GTCATTGAAC TTGCCTGAAG AAGGTAAGGC 
6951 CATTGAAGAT GCAGTTAAAA AGGTTTTGGA TGCAGGTATC AGAACTGGTG 
7 001 ATTTAGGTGG TTCCAACAGT ACCACCGAAG TCGGTGATGC TGTCGCCGAA 
7051 GAAGTTAAGA AAATCCTTGC TTAAAAAGAT TCTCTTTTTT TATGATATTT 
7101 GTACAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
7151 AAAATGCAGC GTCACATCGG AT AATAAT GA CGTCTAAGAA ACCATTATTA 
72 01 T CAT G AC ATT AAC C TAT AAA AATAGGCGTA TCACGAGGCC CTTTCGTCTC 
7251 GCGCGTTTCG GTGATGACGG TGAAAACCTC T G AC AC AT G C AGCTCCCGGA 
7301 GACGGTCACA GCTTGTCTGT AAGCGGATGC CGGGAGCAGA CAAGCCCGTC 
7 351 AGGGCGCGTC AGCGGGTGTT GGCGGGTGTC GGGGCTGGCT TAACTATGCG 
74 01 GCATCAGAGC AGATTGTACT GAGAGTGCAC CATATGGACA TATTGTCGTT 
7 4 51 AGAACGCGGC TACAATTAAT AC AT AAC C T T ATGTATCATA CACATACGAT 
7 501 TTAGGTGACA CTATA 
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Fjg:42 pGenIn022A - 12 093 bp 

1 GAACGCGGCC GCCAGCTGAA GCTTCGTACG CTGCAGGTCG AC G GAT C AAA 

51 ATTGTGACAG CTTTCCAGAA TGGATTATTT TTCCTCAAAT TCCTTGTCTT 

101 CCTGTTTTCA TCTGGACCAT CTCCATAATG AAGCCTTACA TGTTTGGCAC 

151 GTAGCGGAAC GTGATCGTCA CAAACCGTAA GGTAGAGACC CCAGATTTTC 

2 01 GCATTTTCTC TTAAACTCTC CATTAGCTTA GGATCTGACG ATCACCTAGC 

251 GACTCTCTCC ACCGTTTGAC GAGGCCATTT ACAAAAACAT AACGAACGAC 

301 AAGCCTACTC GAATTCGTTT CCAAACTCTT TTCGAACTTG TCTTCAACTG 

351 CTTTCGCATG AAGTACCTCC CAACTACTTT TCCTCACACT TGTACTCCAT 

4 01 GACTAAACCC CCCCTCCCAT TACAAACTAA AATCTTACTT TTATTTTCTT 

4 51 TTGCCCTCTC TGTCGCTCTG CCTTAACTAC GTATTTCTCG CCGAGAAAAA 

501 CTTCAATTTA AGCTATTCTC CAAAAATCTT AGCGTATATT TTTTTTCCAA 

551 AGTGACAGGT GCCCCGGGTA ACCCAGTTCC TCACTATTTT TTACTGCGGA 

601 AGCGGAAGCG GAAAATACGG AAACGCGCGG GAACATACAA AACATACAAA 

651 AT AT AC C T T T CTCACACAAG AAATATATGC TACTTGCAAA ATATCATACC 

7 01 AAAAAACTTT TCACAACCGA AACCAAAACC AACGGAT AT C AT AC AT T AC A 

7 51 CTACCACCAT TCAAACTTTA CTACTATCCT CCCTTCAGTT TCCCTTTTTC 
801 TGCCTTTTTC GGTGACGGAA ATACGCTTCA GAGACCCTAA AGGGAAATCC 

8 51 AT GC CAT AAC AGGAAAGTAA CATCCCAATG CGGACTATAC CACCCCACCA 
901 CACTCCTACC AATAACGGTA ACTATTCTAT GTTTTCTTAC TCCTATGTCT 
951 ATTCATCTTT CATCTGACTA CCTAATACTA TGCAAAAATG TAAAAT CAT C 

1001 ACACAAAACA TAAACAATCA AAATCAGCCA TTTCCGCACC TTTTCCTCTG 

1051 TCCACTTTCA ACCGTCCCTC CAAATGTAAA ATGGCCTATC GGAATACATT 

1101 TTCTACATCC T AAC T AC TAT AAAACAACCT TTAGACTTAC GTTTGCTACT 

1151 CTCATGGTCT CAATACTGCC GCCGACATTC TGTCCCACAT ACTAAATCTC 

12 01 TTCCCGTCAT TATCGCCCGC ATCCGGTGCC GTAAATGCAA AACAAATACC 

1251 ATCTATGTCT TCCACACCAT CATTTTACTA TGCCTGCCAC CATCCATTTG 

1301 TCTTTTGCAC CATATCTTCA TAACCTGTCA CCTTGAAACT ACCTCTGCAT 

1351 GCCACCTACC GACCAACTTT CATGTTCTGT TTCGACCTAC CTCTTGTAAA 

14 01 TGACAAATCA CCTTTTTCAT CGTATGCACC TTATTCTCCA CAT CACAAT G 

14 51 CACTATTGCT TTTGCTTTTT CACCTGTCAT ATCCTATTGC TATTAGATGA 

1501 AATATAATAA AAATTGTCCT CCACCCATAA CACCTCTCAC TCCCACCTAC 

1551 TGAACATGTC TGGACCCTGC CCTCATATCA CCTGCGTTTC CGTTAAACTA 

1601 TCGGTTGCGG CCATATCTAC CAGAAAGCAC CGTTTCCCGT C C GAT C AAC T 

1651 GTAGTTAAGC TGGTAAGAGC CTGACCGAGT AGTGTAGTGG GT GAC CAT AC 

17 01 GCGAAACTCA GGTGCTGCAA TCTTTATTTC TTTTTTTTTT TTTTTTTTTT 
1751 TTTTTTTTTC TAGTTTCTTG GCTTCCTATG CTAAATCCCA TAACTAACCT 

18 01 ACCATTCGAT TCAGAAAAAT TCGCACTATC CAGCTGCACT CTTCTTCTGA 
1851 AGAGTTAAGC ACTCCATTAT GCTCATTGGG TTGCTACTAC TTGATATGTA 
1901 CAAACAATAT TCTCCTCCGA TATTCCTACA AAAAAAAAAA AAAAAACACT 
1951 CCGGTTTTGT TCTCTTCCCT CCATTTCCCT CTCTTCTACG GTTAATACTT 
2 001 TCCTCTTCGT CTTTTTCTAC ACCCTCGTTT AGTTGCTTCT TATTCCTTCC 
2 051 CGCTTTCCTG CACTAACATT TTGCCGCATT ACACTATATG ATCGTAGTAC 
2101 ATCTTACAAC TCCGCATACC GCGTCGCCGC GTCGCCGCGT CGCCAAAAAT 
2151 TTACTTCGCC AACCATTCCA TATCTGTTAA GTATACATGT AT AT AT T G C A 
22 01 CTGGCTATTC ATCTTGCACT TTTCCTCTTT CTTCTTCCCA GTAGCCTCAT 
2251 CCTTTTACGC TGCCTCTCTG GAACTTGCCA TCATCATTCC CTAGAAACTG 
2301 CCATTTACTT AAAAAAAAAA AAAAAAAAAA AATGTCCCCA CTGTTCACTG 
2351 TTCACTGTTC ACTTGTCTCT TACATCTTTC TTGGTAAAAT CGTAGTTCGT 
2 4 01 AGTATTTTTT TTCATATCAA AGGCATGTCC TGTTAACTAT AGGAAATGAG 
2 4 51 CTTTTCTCAA TTCTCTAAAC TTATACAAGC ACCTCATGTT TGCCGCTCTG 
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2501 ATGGTGCGGA AAAAACTGCT CCATGAAGCA AACTGTCCGG GCAAATCCTT 

2551 TCACGCTCGG GAAGCTTTGT GAAAGCCCTT CTCTTTCAAC CCATCTTTGC 

2 601 AACGAAAAAA AAAAAAAAAA TAAAAAATAA AAAGAC C AAA TAGTAAATAG 

2 651 TAACTTACAT ACATTAGTAA ATGGTACACT CTTACACACT ATCATCCTCA 

27 01 TCGTATATTA TAATAGATAT ATACAATACA TGTTTTTACC CG GAT CAT AG 

27 51 AATTCTTAAG ACAAATAAAA T T T AT AG AGA CTTGTTCAGT CTACTTCTCT 

2 8 01 CTAAACTAGG CCCCGGCTCC TGCCAGTACC C AC T TAG AAA GAAATAAAAA 

2 851 ACAAATCAGA CAACAAAGGC TTAATCTCAG CAGATCGTAA CAACAAGGCT 

2 901 ACTCTACTGC TTACAATACC CCGTTGTACA TCTAAGTCGT AT ACAAAT GA 

2 951 TTTATCCCCA CGCAAAATGA CATTGCAATT CGCCAGCAAG CACCCAAGGC 
3001 CTTTCCGCCA AGTGCACCGT TGCTAGCCTG CTATGGTTCA GCGACGCCAC 
3051 AAGGACGCCT TATTCGTATC CATCTATATT GTGTGGAGCA AAGAAAT C AC 
3101 CGCGTTCTAG CATGGATTCT GACTTAGAGG CGTTCAGCCA TAATCCAGCG 
3151 GATGGTAGCT TCGCGGCAAT GCCTGATCAG ACAGCCGCAA AAACCAATTA 
32 01 TCCGAATGAA CTGTTCCTCT CGTACTAAGT TCAATTACTA TTGCGGTAAC 
32 51 ATTCATCAGT AGGGTAAAAC TAACCTGTCT CACGACGGTC TAAACCCAGC 
3301 TCACGTTCCC TATTAGTGGG TGAACAATCC AACGCTTACC GAATTCTGCT 
3351 TCGGTATGAT AGGAAGAGCC GACATCGAAG AATCAAAAAG CAATGTCGCT 
34 01 ATGAACGCTT GACTGCCACA AGCCAGTTAT CCCTGTGGTA ACTTTTCTGG 
34 51 CACCTCTAGC CTCAAATTCC GAGGGAC TAA AGGATCGATA GGCCACACTT 
3501 TCATGGTTTG TATTCACACT GAAAATCAAA ATCAAGGGGG CTTTTACCCT 
3551 TTTGTTCTAC TGGAGATTTC TGTTCTCCAT GAGCCCCCCT TAG GAC AT C T 

3 601 GCGTTATCGT TTAACAGATG TGCCGCCCCA GCCAAACTCC CCACCTGACA 
3 651 ATGTCTTCAA CCCGGATCAG CCCCGAATGG GACCTTGAAT GCTAGAACGT 

37 01 GGAAAAT GAA TTCCAGCTCC GCTTCATTGA ATAAGTAAAG AAAC TAT AAA 
3751 GGTAGTGGTA TTTCACTGGC GCCGAAGCTC CCACTTATTC TACACCCTCT 

38 01 ATGTCTCTTC ACAATGTCAA AC T AGAGT C A AGCTCAACAG GGTCTTCTTT 

3 8 51 CCCCGCTGAT TCTGCCAAGC CCGTTCCCTT GGCTGTGGTT TCGCTAGATA 
3901 GTAGATAGGG ACAGTGGGAA TCTCGTTAAT CCATTCATGC GCGTCACTAA 
3951 TTAGATGACG AGGCATTTGG CTACCTTAAG AGAGTCATAG TTACTCCCGC 

4 001 CGTTTACCCG CGCTTGGTTG AATTTCTTCA CTTTGACATT CAGAGCACTG 
4 051 GGCAGAAATC ACATTGCGTC AACATCACTT TCTGACCATC GCAATGCTAT 
4101 GTTTTAATTA GACAGTCAGA TTCCCCTTGT CCGTACCAGT TCTAAGTTGA 
4151 TCGTTAATTG TAGCAAGCGA CGGTCTACAA GAGACCTACC AAGGCCGTCT 
4201 ACAACAAGGC ACGCAAGTAG TCCGCCTAGC AGAGCAAGCC CCACCAAGCA 
4251 GTCCACAAGC ACGCCCGCTG CGTCTGACCA AGGCCCTCAC TACCCGACCC 
4301 TTAGAGCCAA TCCTTATCCC GAAGTTACGG ATCTATTTTG CCGACTTCCC 
4 351 TTATCTACAT TATTCTATCA ACTA GAG GCT GTTCACCTTG GAGACCTGCT 
4 4 01 GCGGTTATCA GTACGACCTG GCATGAAAAC TATTCCTTCC TGTGGATTTT 
4 451 CACGGGCCGT CACAAGCGCA CCGGAGCCAG CAAAGGTGCT GGCCTCTTCC 
4 501 AGCCATAAGA CCCCATCTCC GGATAAACCA ATTCCGGGGT GATAAGCTGT 
4551 TAAGAAGAAA AGATAACTCC TCCCAGGGCT CGCGCCGACG TCTCCACATT 
4 601 CAGTTACGTT ACCGTGAAGA ATCCATATCC AGGTTCCGGA ATCTTAACCG 
4 651 GATTCCCTTT CGATGGTGGC CTGCATAAAA TCAGGCCTTT GAAACGGAGC 
47 01 TTCCCCATCT CTTAGGATCG ACTAACCCAC GTCCAACTGC TGTTGACGTG 
4751 GAACCTTTCC CCACTTCAGT CTTCAAAGTT CTCATTTGAA TATTTGCTAC 
4 8 01 TACCACCAAG ATCCAAGCTA TCT ACT GAGA TTTCTGGCTC TTTTGTTGTA 
4 851 CTGTCACCTA ACCACAGACC AAGCATCCAA GCCATACTTT TTACAGCAGG 
4 901 AGTTACAAGG TCACTACGTC CAGTGAGAAA TTTAGATAAA ACACCATTTC 
4 951 CTGCGAGTAC TGGACCAAAT CTTATGCAGC TAGAAATTCT CAATTGAGCA 
5001 TCAAGATAAT CCAAATCTCT AACTTCAATG TCAAAGTTGA AATATTCTCC 
5051 TTTAGAGCGC TCCATTTCTT CTATGAAGCG TTTTGCGGCA AACTCACCTT 
5101 CAACTGTCAT TGGGAATGTC TTATGATGGT TTTTTGGAAT TAT TAT TAT C 
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5151 CTACCATCAA GCGTCTGACA 

52 01 TTTGGTGGCA TTTCTACCAC 

5251 TGACTGACAA TTTATGACCT 

5301 ACTGTTCTAC GCCTGGGATC 

5351 GTGTTTGGTT TTTTGCTGTA 

54 01 CAT T TAT G AC GCGGGCAGCG 

54 51 GCCGTGACGT TTGCGATGTC 

5501 CGTGTTTTCA GAAAGC AT T T 

5551 AGATTATGTG TTGCACTTTT 

5 601 TAATACTTTC ATAAAGATAC 

5 651 AAATTTTTTT TCTTTGCCGA 

57 01 GAAGTTTAGA TGAGTAAGGA 

57 51 GGAACAAAAA CTAAAAACAA 

58 01 AACGACAGCA GGATGGCTCA 
58 51 GAACAAAGCT CCTCTCCTGC 
5901 CGTAAGAAAG TTTTTCTGTG 
5951 TTTTCCCTCT TTTGTGGTCC 
6001 ACAAACAGCG GAGAAGTGTG 
6051 AGAACTAACC GCACGAACTG 
6101 GAGAAACAGA GAAAC GAT C A 
6151 GAGACTGGAA AGGGAAAAAG 
62 01 GGCGAAGTCC GAACCCTTGA 
6251 AATGCGCTAC GGTTCCTGCC 
6301 TTCGCTTCTC TCTGCTGACA 
6351 CACTATAAGG GTTCCTTCGG 
64 01 TCATAAGGCC TTTATGGTGT 
64 51 CAATGATATA GAGCCCACGC 
6501 GGGGTGGGCG GACAAAGCTA 
6551 TAACAACAAC CAGCTCACGT 
6601 CGAGGTTTCC AAAGTAGCAT 
6651 TTTAATGGTG CGTGGTTCCA 

67 01 GATGGTAAGA AGATTCTGAA 
6751 AAGATCTGTC CTCTTAAGTA 

68 01 ACTACTCAAC GCGTACGCAA 
68 51 TATGTATCAA AAATTTAACA 
6901 TGTCACGTTC TGGACCTGAG 
6951 GGGGAAGACC CTTCCGCCTT 
7 0 01 ACCCAATATA GCAAAGTTTC 
7 051 AACAAAAAGA AAAAAAAGGA 
7101 CAAAACTCTT CTCACTTGAC 
7151 ACTTTTCTCT TTCTTCACTA 
72 01 CGCTCAGAAC TTAGCTCTAT 
72 51 TGATGTTAAT TAACTCTAAA 
7301 ATTTTGGTTG AAT TAG AT G G 
7351 CGGTGAAGGT GAAGGTGATG 

74 01 TTTGTACTAC TGGTAAATTG 
7 4 51 TTCGGTTATG GTGTTCAATG 

75 01 ACATGACTTT TTCAAGTCTG 
7 551 CTATTTTTTT C AAA GAT GAC 
7 601 TTTGAAGGTG ATACCTTAGT 
7 651 TAAAGAAGAT GGTAACATTT 
77 01 CTCACAATGT TTAC AT CAT G 
7751 AACTTCAAAA TTAGACACAA 
7 8 01 CCATTATCAA CAAAATACTC 
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TTGCTGCAGA TTTCTCCATC TCACTTTATA 
TTTTTTCCAA CAGTGGTTTG GTAGGGACCC 
GCAGTACATT GTAATGCAAG ACGCTGATAA 
TAACCTACCA GGTTCACCTT CAAAAGCTCT 
TATTATAGAT TTTCTGATAG CCCTGTGTGA 
GAGCCATCTG C G C AC AT AAC GTAAGAGTTA 
TTTAATTTCA CCGTTAGCCA TCAGAATAGT 
TGATCCGACA TACGATGACC TCAATGATTT 
ATA GAC C TAG CAAAAATCCA GTGCGTACAC 
CTGAAACAAT AACCAGAAAG AT C GGCAAAA 
GATCACAAAC CTACTATGAC GAAAAAGCTT 
AAATACAAGT GACGCTTTTA TATGGTGCAA 
CAAGGCAAAT GTGGATCTGT CATGTATGGC 
CAAAAAAAGA CAAAAAAAAC TAAGGCAAAA 
T CAA GAAAC G TATTGTTGAA AAACCACCGT 
ACCTATAATG GTTTAAAATC GGCCCATTTT 
AGTCTTTCTC ATACTCGAGG GAAATTCGAC 
GCTAAACCGG CAAGTGCCTG CAAGATCCAC 
GCGGTCAGAA AAGAGCCTGT TCCGGAAAGA 
TGATGGGAAA GCGGGGATTC GGCGAAGAAC 
AGAAAT AC T G GTGGAAGTAT TCGGACCTTT 
AACCCAAAGA T GAT C GAT GA TTCATTTTTC 
GCTCGTGGGA ACCCCACGCA AAAC AT AT T A 
ACTCCGGTTT ACGTTATACC GTATTAGGAT 
GAGGAGGGGG GAGGGGAAGA ATGTACATCG 
GAAGTGGGTT TTGCGTGGAA AATTCGTTTT 
ATATACGTAC ATACTAGTGG CCAAAAGCGT 
CACTGGTAAA ATACAGGATT CTATGAACAA 
TGCTGAACAG CCGAGGTCAG CCGATGCAAC 
TTCTGTGCTA GCTATGTCTG TAGGTTTACA 
GCTTCATGTG CTTGCATGTG ATGTCCTGCA 
AGCCGCGCTA G G AG AAAAAT ATTCTGCTCG 
GAAAGCGTGA AATTGTTGCG TTCTTGCATT 
ATGCGTCTAC TGCACCTGCA TGATAAAGCT 
TCTTGAAAAT AC AC AAGT GG TGCAAAGATG 
TGGTGCCATG TATGCTATTT AAC AT G C AAA 
AC T G C AAT AA T AAAAAG TAT TTTACGCGTT 
GCGCAAAAAA AAAAATAAAA AACAATTACA 
AATAATAGAA GATCTAACTG AAGCGAAGGC 
GTAATAGCCG AT AC AAAAT C TAGAGCAGCA 
AAGCTGCTAC G AAAG TAT AG AAAAAT CAAA 
TTCAAGGTAC CATATATATT TCCTTATAAC 
GGTGAAGAAT TATTCACTGG TGTTGTCCCA 
TGATGTTAAT GGTCACAAAT TTTCTGTCTC 
CTACTTACGG TAAATTGACC TTAAAATTTA 
CCAGTTCCAT GGCCAACCTT AG TC AC TACT 
TTTTGCGAGA TACCCAGATC ATATGAAACA 
CCATGCCAGA AGGTTATGTT CAAGAAAGAA 
GGTAACTACA AG AC C AG AG C TGAAGTCAAG 
TAATAGAATC GAATTAAAAG GTATTGATTT 
TAGGTCACAA ATTGGAATAC AACTATAACT 
GCT GAC AAAC AAAAGAATGG TAT C AAAGTT 
CATTGAAGAT GGTTCTGTTC AATTAGCTGA 
CAATTGGTGA TGGTCCAGTC TTGTTACCAG 
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7 8 51 AC AAC CAT TA CTTATCCACT CAATCTGCCT TATCCAAAGA TCCAAACGAA 

7 901 AAGAG AG AC C ACATGGTCTT GTTAGAATTT GTTACTGCTG CTGGTATTAC 

7 951 CCATGGTATG GATGAATTGT ACAAATAACT GCAGGGCGCG CCACTTCTAA 

8 001 ATAAGCGAAT TTCTTATGAT TTATGATTTT TAT TAT T AAA TAAGTTATAA 
8 051 AAAAAATAAG T GTAT ACAAA TTTTAAAGTG ACTCTTAGGT TTTAAAACGA 
8101 AAATTCTTAT TCTTGAGTAA CTCTTTCCTG TAGGTCAGGT TGCTTTCTCA 
8151 GGTATAGTAT GAGGTCGCTC TTATTGACCA CACCTCTACC GGCAGATCCG 
82 01 CTAGGGATAA CAGGGTAATA TAGATCTGCC CGCCGGGAAG GCGAACCCGA 
8251 TCGGATGCAT CCTCTCTGCT GCCATGATGC TGAAGTTGTC GTTGAACATG 
8 301 GTTGCTGCCG GCGAGGCGGT CGAGCAGGCA GTGCAGGAGG TGTTGGACTC 
8 351 GGGAGTCAGA ACGGGCGACC TGCTCGGCTC GAGCTCGAAT TCATCGATGA 
8 4 01 TATCAGATCC ACTAGTGGCC TATGCGGCCG CGGATCTGCC GGTCTCCCTA 
8 4 51 TAGTGAGTCG TATTAATTTC GATAAGCCAG GTTAACCTGC ATTAATGAAT 
8501 CGGCCAACGC GCGGGGAGAG GCGGTTTGCG TATTGGGCGC TCTTCCGCTT 
8551 CCTCGCTCAC TGACTCGCTG CGCTCGGTCG TTCGGCTGCG GCGAGCGGTA 
8 601 TCAGCTCACT CAAAGGCGGT AATACGGTTA TCCACAGAAT CAGGGGATAA 
8 651 CGCAGGAAAG AAC AT G T GAG CAAAAGGCCA GCAAAAGGCC AG G AAC C G T A 
8701 AAAAGGCCGC GTTGCTGGCG TTTTTCCATA GGCTCCGCCC CCCTGACGAG 
8751 CAT C AC AAAA ATCGACGCTC AAGTCAGAGG TGGCGAAACC CGACAGGACT 
8 8 01 ATAAAGATAC CAGGCGTTTC CCCCTGGAAG CTCCCTCGTG CGCTCTCCTG 
8 8 51 TTCCGACCCT GCCGCTTACC GGATACCTGT CCGCCTTTCT CCCTTCGGGA 

8 901 AGCGTGGCGC TTTCTCAATG CTCACGCTGT AGGTATCTCA GTTCGGTGTA 
8951 GGTCGTTCGC TCCAAGCTGG GCTGTGTGCA CGAACCCCCC GTTCAGCCCG 
9001 ACCGCTGCGC CTTATCCGGT AACTATCGTC TTGAGTCCAA CCCGGTAAGA 
9051 CACGACTTAT CGCCACTGGC AGCAGCCACT GGTAACAGGA TTAGCAGAGC 
9101 GAGGTATGTA GGCGGTGCTA CAGAGTTCTT GAAGTGGTGG CCTAACTACG 
9151 GCTACACTAG AAGGACAGTA TTTGGTATCT GCGCTCTGCT GAAGCCAGTT 
92 01 ACCTTCGGAA AAAGAGTTGG TAGCTCTTGA TCCGGCAAAC AAACCACCGC 
9251 TGGTAGCGGT GGTTTTTTTG TTTGCAAGCA G C AG AT T AC G CGCAGAAAAA 
9301 AAGGATCTCA AGAAGATCCT TTGATCTTTT CTACGGGGTC TGACGCTCAG 
9351 TGGAACGAAA ACTCACGTTA AGGGATTTTG GTCATGAGAT TAT CAAAAAG 
94 01 GATCTTCACC TAGATCCTTT TAAATTAAAA ATGAAGTTTT AAATCAATCT 
94 51 AAAGTATATA TGAGTAAACT TGGTCTGACA GTTACCAATG CTTAATCAGT 
9501 GAGGCACCTA TCTCAGCGAT CTGTCTATTT CGTTCATCCA TAGTTGCCTG 
9551 ACTCCCCGTC GTGTAGATAA CTACGATACG GGAGGGCTTA CCATCTGGCC 

9 601 CCAGTGCTGC AATGATACCG CGAGACCCAC GCTCACCGGC T C C AG AT T T A 
9 651 T C AGC AAT AA ACCAGCCAGC CGGAAGGGCC GAGCGCAGAA GTGGTCCTGC 
9701 AACTTTATCC GCCTCCATCC AGTCTATTAA TTGTTGCCGG GAAGCTAGAG 
9751 TAAGTAGTTC GCCAGTTAAT AGTTTGCGCA ACGTTGTTGC CATTGCTACA 
98 01 GGCATCGTGG TGTCACGCTC GTCGTTTGGT ATGGCTTCAT TCAGCTCCGG 
98 51 TTCCCAACGA TCAAGGCGAG TTACATGATC CCCCATGTTG TGCAAAAAAG 
9901 CGGTTAGCTC CTTCGGTCCT CCGATCGTTG TCAGAAGTAA GTTGGCCGCA 
9 951 GTGTTATCAC TCATGGTTAT GGCAGCACTG CAT AAT T C T C TTACTGTCAT 

10001 GCCATCCGTA AGATGCTTTT CTGTGACTGG TGAGTACTCA ACCAAGTCAT 

10 051 TCTGAGAATA GTGTATGCGG CGACCGAGTT GCTCTTGCCC GGCGTCAATA 

10101 C G GG AT AAT A CCGCGCCACA TAGCAGAACT TTAAAAGTGC TCATCATTGG 

10151 AAAACGTTCT TCGGGGCGAA AACTCTCAAG GATCTTACCG CTGTTGAGAT 

102 01 CCAGTTCGAT GTAACCCACT CGTGCACCCA ACTGATCTTC AGCATCTTTT 

10251 ACTTTCACCA GCGTTTCTGG GTGAGCAAAA ACAGGAAGGC AAAATGCCGC 

10 301 AAAAAAGGGA AT AAGGGC GA C AC G GAAAT G TTGAATACTC ATACTCTTCC 

10351 TTTTTCAATA TTATTGAAGC ATTTATCAGG GTTATTGTCT CATGAGCGGA 

104 01 TACATATTTG AATGTATTTA GAAAAATAAA CAAATAGGGG TTCCGCGCAC 



WO 2005/012533 



PCT/GB2004/003337 



61/61 



104 51 ATTTCCCCGA AAAGTGCCAC 

10501 TTTTTTTTTT TTTTTTTTTT 

10551 CATTGTAATG TCTGCCCCTA 

10 601 TTGGTCAAGA AATCACAGCC 

10 651 GATGTTCGTT CCAATGTCAA 

107 01 TGCTGCTATC GATGCTACAG 

10751 CCTCCAAGAA GGCTGATGCC 

10801 TGGGGTACCG GTAGTGTTAG 

10 8 51 AGAACTTCAA TTGTACGCCA 

10901 CTCTTTTAGA CTTATCTCCA 

10951 TTCGTTGTTG TCAGAGAATT 

11001 G GAAG AC GAT GGTGATGGTG 

11051 CAGAAGTGCA AAGAATCACA 

11101 GAG CC AC CAT TGCCTATTTG 

11151 TTCAAGATTA TGGAGAAAAA 

112 01 C T AC AT T G AA GGTTCAACAT 

112 51 GTTAAGAACC CAACCCACCT 

11301 TGGTGATATC ATCTCCGATG 

11351 TGTTGCCATC TGCGTCCTTG 

114 01 GGTTTGTACG AACCATGCCA 

114 51 GGTCAACCCT ATCGCCACTA 

11501 CAT T GAAC T T GCCTGAAGAA 

11551 GTTTTGGATG CAGGTATCAG 

11601 CACCGAAGTC GGTGATGCTG 

11651 AAAAAGATTC TCTTTTTTTA 

11701 AAAAAAAAAA AAAAAAAAAA 

117 51 AATAATGACG TCTAAGAAAC 

118 01 TAGGCGTATC ACGAGGCCCT 
11851 AAAACCTCTG ACACAT GCAG 
11901 GCGGATGCCG GGAGCAGACA 
11951 CGGGTGTCGG GGCTGGCTTA 
12 0 01 GAGTGCACCA TATGGACATA 
12051 ATAACCTTAT GTATCATACA 



CTGACGTCGA AT AT CAT T GA GAAGCTGCAT 
TTTTTTTTTA TATATATTTC AAGGATATAC 
AGAAGATCGT CGTTTTGCCA GGTGACCACG 
GAAG C CAT TA AGGTTCTTAA AGCTATTTCT 
GTTCGATTTC GAAAATCATT TAATTGGTGG 
GTGTTCCACT T CCA GAT GAG GCGCTGGAAG 
GTTTTGTTAG GTGCTGTGGG TGGTCCTAAA 
ACCTGAACAA GGTTTACTAA AAATCCGTAA 
ACTTAAGACC ATGTAACTTT GCATCCGACT 
ATCAAGCCAC AATTTGCTAA AGGTACTGAC 
AGTGGGAGGT ATTTACTTTG GTAAGAGAAA 
TCGCTTGGGA TAGTGAACAA TACACCGTTC 
AGAATGGCCG CTTTCATGGC C C T AC AAC AT 
GTCCTTGGAT AAAGCTAATG TTTTGGCCTC 
CTGTGGAGGA AACCATCAAG AACGAATTCC 
CAATTGATTG ATTCTGCCGC CATGATCCTA 
AAATGGTATT ATAATCACCA GCAACATGTT 
AAGCCTCCGT TATCCCAGGT TCCTTGGGTT 
GCCTCTTTGC CAGACAAGAA CACCGCATTT 
CGGTTCTGCT CCAGATTTGC CAAAGAATAA 
TCTTGTCTGC TGCAATGATG TTGAAATTGT 
GGTAAGGCCA TTGAAGATGC AGTTAAAAAG 
AACTGGTGAT TTAGGTGGTT CCAACAGTAC 
TCGCCGAAGA AG T T AAG AAA ATCCTTGCTT 
TGATATTTGT ACAAAAAAAA AAAAAAAAAA 
AAAAAAAAAA AATGCAGCGT CACATCGGAT 
CAT TAT TAT C ATGACATTAA CCTATAAAAA 
TTCGTCTCGC GCGTTTCGGT GATGACGGTG 
CTCCCGGAGA CGGTCACAGC TTGTCTGTAA 
AGCCCGTCAG GGCGCGTCAG CGGGTGTTGG 
ACTATGCGGC AT C AG AG C AG ATTGTACTGA 
TTGTCGTTAG AACGCGGCTA CAATTAATAC 
CATACGATTT AGGTGACACT ATA 



